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APPELLANTS' BRIEF 

MAIL STOP APPEAL BRIEF - PATENTS 

Commissioner for Patents 
P.O. Box 1450 

Alexandria, Virginia 32613-1450 



Dear Sir: 

This Appeal Brief, filed in connection with the above captioned patent application, is 
responsive to the Final Office Actions mailed on December 27, 2007. A Notice of Appeal was 
filed on May 27, 2008. The following constitutes the Appellants 1 Brief on Appeal, submitted 
with a request for a two month extension of time with the requisite fee. As the two month 
extension deadline for response fell on a Saturday, this brief is timely filed on the following 
business day September 29, 2008. 

Appellants hereby appeal to the Board of Patent Appeals and Interferences from the final 
rejection in this case. 
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I. REAL PARTY IN INTEREST 

The real party in interest is Genentech, Inc., South San Francisco, California, by an 
assignment of the parent application, U.S. Patent Application Serial No. 09/665,350 recorded 
July 9, 2001, at Reel 01 1964 and Frame 018L The present application is a continuation of U.S. 
Patent Application Serial No. 09/665,350. 

II. RELATED APPEALS AND INTERFERENCES 

The claims pending in the current application are directed to a polypeptide referred to 
herein as "PR0232." Although there exist several applications directed to the "gene 
amplification" utility under Appeal, there are no applications related to PR0232 nucleic acids or 
antibodies. 

III. STATUS OF CLAIMS 

Claims 44-46 and 49-5 1 are in this application. 
Claims 1-43 and 47-48 have been canceled. 

Claims 44-46 and 49-51 stand rejected and Appellants appeal the rejection of these 

claims. 

A copy of the rejected claims in the present Appeal is provided in the Claims Appendix. 

IV. STATUS OF AMENDMENTS 

A summary of the prosecution history for this case is as follows: 

Previously, in response to a Final Office Action mailed on October 21, 2005, a Notice of 
Appeal was filed on February 21, 2006 and an Appeal Brief was filed on April 17, 2006. An 
RCE Response with additional references and affidavits supporting Appellants' arguments was 
filed on July 27, 2006. A Final Office Action was mailed on October 17, 2006, and a Notice of 
Appeal was filed on March 19, 2007. An Appeal Brief was filed on September 24, 2007. 

Upon further consideration, the Examiner withdrew the finality of the previous Office 
Action solely to clarify the issues for appeal as outlined in a Final Office Action mailed 
December 27, 2007. A Notice of Appeal was filed on May 27, 2008. 

No claim amendments have been submitted after the last final rejection of October 17, 
2006. A copy of the rejected claims in the present Appeal is provided in the Claims Appendix. 
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V. SUMMARY OF CLAIMED SUBJECT MATTER 

The invention claimed in the present application is related to an isolated polypeptide 
comprising the amino acid sequence of the polypeptide of SEQ ID NO: 18, referred to in the 
present application as "PR0232". The PR0232 gene was shown for the first time in the present 
application to be significantly amplified in human lung and colon cancers as compared to 
normal, non-cancerous human tissue controls (Example 92). This feature is specifically recited 
in Claims 44(a) and 45, and carried by all claims dependent from Claim 44. In addition, the 
invention also claims the amino acid sequence of the polypeptide of SEQ ID NO: 1 8, lacking its 
associated signal-peptide (Claims 44(b) and 46); or the amino acid sequence of the polypeptide 
encoded by the full-length coding sequence of the cDNA deposited under ATCC accession 
number 209250 (Claims 44(c) and 49). The invention is further directed to a chimeric 
polypeptide comprising one of the above polypeptides fused to a heterologous polypeptide 
(Claim 50), and to a chimeric polypeptide wherein the heterologous polypeptide is an epitope tag 
or an Fc region of an immunoglobulin (Claim 51). The preparation of chimeric PRO 
polypeptides (Claims 50 and 51), including those wherein the heterologous polypeptide is an 
epitope tag or an Fc region of an immunoglobulin, is set forth in the specification at page 74, line 
23 to page 75, line 5. Examples 53-56, pages 192-199, describe the expression of PRO 
polypeptides in various host cells, including E. coli, mammalian cells, yeast and Baculovirus- 
infected insect cells. 

The amino acid sequence of the "PR0232" polypeptide and the nucleic acid sequence 
encoding this polypeptide (referred to in the present application as "DNA34435-1 140") are 
shown in the present specification as SEQ ID NOs: 18 and 17, respectively, and in Figures 9 and 
8, described on page 59, lines 4-7. The full-length PR0232 polypeptide having the amino acid 
sequence of SEQ ID NO: 18 is described in the specification at, for example, on page 4, 
pages 3-4 and page 100, page 131, lines 9 to 16 and the isolation of cDNA clones encoding 
PR0232 of SEQ IDNO:18 is described in Example 4, pages 149-150 of the specification. The 
specification discloses that the PR0232 polypeptide possess significant sequence homology to 
cell surface stem cell antigen (35% sequence identity with a stem cell surface antigen from 
Gallus gallus) and may play a role in cell proliferation and/or differentiation, (see for example, 
page 4 and Example 4, lines 14-15). 
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Finally, Example 92, in the specification at page 222, line 26, to page 235, line 3, sets 
forth a 'Gene Amplification assay' which shows that the PR0232 gene is amplified in the 
genome of certain human lung cancers (see Table 9, pages 230-234). The profiles of various 
primary lung and colon tumors used for screening the PRO polypeptide compounds of the 
invention in the gene amplification assay are summarized on Table 8, page 227 of the 
specification. 

VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

1 . Whether instant Claims 44-46 and 49-5 1 satisfy the utility /enablement 
requirement under 35 U.S. C. §§101/1 12, first paragraph. 

2. Whether Claims 44-46 and 49-51 are entitled to the priority date of U.S. 
Provisional Patent Application Serial No. 60/059121, filed September 17, 1997. 

3. Whether Claims 44-46 and 49-51 are anticipated under 35 U.S.C. § 102(b) by 
Rosenthal etaL, DE19818619-A1 (October 1999). 

VIL ARGUMENTS 
Summary of the Arguments: 
Issue 1: Utility/ Enablement 

Appellants rely upon the gene amplification data of the PR0232 gene for patentable 
utility of the PR0232 polypeptides. This data is clearly disclosed in the instant specification in 
Example 92 which discloses that the gene encoding PR0232 showed significant amplification, 
ranging from 2.056-fold to 5.28-fold, in five lung tumors or 2.00-fold to 5.32-fold in seven colon 
tumors . Example 92 in the instant specification discloses that, "(a)mplification is associated with 
overexpression of the gene product, indicating that the polypeptides are useful targets for 
therapeutic intervention in certain cancers such as colon, lung, breast and other cancers and 
diagnostic determination of the presence of those cancers" (emphasis added). 

Appellants have submitted ample evidence to show that, in general, if a gene is amplified 
in cancer, it is more likely than not that the encoded protein will be expressed at an elevated 
level. First, Appellants have submitted, in their Response filed July 25, 2005, a Declaration by 
Dr. Audrey Goddard, which explains that a gene identified as being amplified at least 2-fold by 
the disclosed gene amplification assay in a tumor sample relative to a normal sample is useful as 
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a marker for the diaRnosis of cancer , and for monitoring cancer development and/or for 
measuring the efficacy of cancer therapy. Therefore, such a gene is useful as a marker for the 
diagnosis of lung and colon cancers , and for monitoring cancer development and/or for 
measuring the efficacy of cancer therapy. Second, Appellants have submitted, in their Responses 
filed August 9, 2004 and July 27, 2006, ample evidence to show that, in general, if a gene is 
amplified in cancer, it is more likely than not that the encoded protein will be expressed at an 
elevated level. For instance, the articles by Orntoft et al 9 Hyman et al, and Pollack et al 
collectively teach that in general, gene amplification increases mRNA expression . Third, 
Appellants have submitted over a hundred references, along with Declarations of Dr. Paul 
Polakis, which collectively teach that, in general, there is a correlation between mRNA levels 
and polypeptide levels . 

Taken together, although there are some examples in the scientific art that do not fit 
within the central dogma of molecular biology that there is generally a positive correlation 
between DNA, mRNA, and polypeptide levels, in general, in the majority of amplified genes , the 
art overwhelmingly shows that gene amplification influences gene expression at the mRNA and 
protein levels . Therefore, one of skill in the art would reasonably expect in this instance, based 
on the amplification data for the PR0232 gene, that the PR0232 polypeptide is concomitantly 
overexpressed and has utility in the diagnosis of lung and colon cancers. 

Appellants further submit that, as evidenced by the Ashkenazi Declaration and the 
teachings of Hanna and Mornin (both made of record in Appellants' Responses filed December 
10, 2003 and August 9, 2004), simultaneous testing of gene amplification and gene product over- 
expression enables more accurate tumor classification , even if the gene-product, the protein, is 
not over-expressed. This leads to better determination of a suitable therapy for the tumor, as 
demonstrated by a real-world example of the breast cancer marker HER-2/neu. Therefore, as a 
general rule, one skilled in the art would find it more likely than not that PR0232 polypeptides 
are useful as a diagnostic tools for detecting lung and colon tumors. 

As the Examiner no longer questions whether mRNA levels are not predictive of 
polypeptide levels, the evidence presented by Appellants support that gene amplification 
correlates with the increased protein expression. Based on the 2.00-fold to 5.32-fold 
amplification in colon and lung primary tumors, one of ordinary skill would find it credible that 
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the claimed PR0232 polypeptides would have utility as markers for the diagnosis of lung and 
colon tumors , and for monitoring cancer development and/or for measuring the efficacy of 
cancer therapy. 

Accordingly, Appellants submit that when the proper legal standard is applied, one 
should reach the conclusion that the present application discloses at least one patentable utility 
for the claimed PR0232 polypeptides. Further, one of ordinary skill in the art would understand 
how to make and use the recited polypeptides for the diagnosis of lung and colon cancers without 
any undue experimentation. 

Accordingly, this enablement rejection under 35 U.S.C. §§101 and 1 12, first paragraph 
should be withdrawn. 

Issue 2: Priority 

The instant application has not been granted the earlier priority date of U.S. Provisional 
Patent Application Serial No. 60/059121, filed September 17, 1997 on the grounds that the 
60/059121 application fails to provide a utility and lacks an enabling disclosure for the claimed 
invention under 35 U.S.C. §§101/1 12, first paragraph. 

Appellants submit that, for the same reasons discussed above under Issue 1, U.S. 
Provisional Patent Application Serial No. 60/059121 also satisfies the utility requirements. 
Therefore, Appellants should be entitled to the priority date of September 17, 1997. 

Issue 3: Anticipation by Rosenthal et aL 

As discussed above under Issue 2, the present application should be entitled to the earlier 
filing date of September 17, 1997 and therefore, Rosenthal et aL, DE19818619-A1, dated 
October 1999, is not prior art. Thus the instant claims are not anticipated by Rosenthal et aL 

These arguments are all discussed in greater detail below, under their appropriate 
headings. 
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Response to Rejections 

ISSUE 1: The Instant Claims 44-46 and 49-51 Satisfy the Utility Requirement under 

35 U.S.C. §101/§112, First Paragraph, based on the results of the gene amplification assay 

The sole basis for the Examiner's rejection of Claims 44-46 and 49-5 1 under this section 

is that the data presented in the instant Application, allegedly, does not satisfy the requirements 

of 35 U.S.C. §§101/1 12, first paragraph. Appellants strongly disagree for the reasons discussed 

below. 

A. The Legal Standard For Utility Under 35 U.S.C. §101 

According to 35 U.S.C. §101: 

Whoever invents or discovers any new and useful process, machine, manufacture, 
or composition of matter, or any new and useful improvement thereof, may obtain 
a patent therefor, subject to the conditions and requirements of this title. 
(Emphasis added). 

In interpreting the utility requirement, in Brenner v. Manson, 1 the Supreme Court held 
that the quid pro quo contemplated by the U.S. Constitution between the public interest and the 
interest of the inventors required that a patent Applicant disclose a "substantial utility" for his or 

2 

her invention, i.e., a utility "where specific benefit exists in currently available form." The 
Court concluded that "a patent is not a hunting license. It is not a reward for the search, but 
compensation for its successful conclusion. A patent system must be related to the world of 

commerce rather than the realm of philosophy." 3 

4 

Later, in Nelson v. Bowler, the C.C.P.A. acknowledged that tests evidencing 
pharmacological activity of a compound may establish practical utility, even though they may 
not establish a specific therapeutic use. The Court held that "since it is crucial to provide 
researchers with an incentive to disclose pharmaceutical activities in as many compounds as 



' Brenner v. Manson, 383 U.S. 519, 148 U.S.P.Q. (BNA) 689 (1966). 

2 Id. at 534, 148 U.S.P.Q. (BNA) at 695. 

3 Id. at 536, 148 U.S.P.Q. (BNA) at 696. 

4 Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. (BNA) 881 (C.C.P.A. 1980). 
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possible, we conclude adequate proof of any such activity constitutes a showing of practical 
utility." 5 

In Cross v. Iizuka?* the C.A.F.C reaffirmed Nelson, and added that in vitro results might 
be sufficient to support practical utility, explaining that "in vitro testing, in general, is relatively 
less complex, less time consuming, and less expensive than in vivo testing. Moreover, in vitro 
results with the particular pharmacological activity are generally predictive of in vivo test results, 

Le. t there is a reasonable correlation there between." 7 The Court perceived, "No insurmountable 
difficulty" in finding that, under appropriate circumstances, "in vitro testing, may establish a 

8 

practical utility." 

The case law has also clearly established that Appellants' statements of utility are usually 

9 

sufficient, unless such statement of utility is unbelievable on its face. The PTO has the initial 

10 

burden to prove that Appellants' claims of usefulness are not believable on their face. In 
general, an Appellant's assertion of utility creates a presumption of utility that will be sufficient 
to satisfy the utility requirement of 35 U.S.C. §101, "unless there is a reason for one skilled in 

II 12 

the art to question the objective truth of the statement of utility or its scope." ' 



5 Id at 856, 206 U.S.P.Q. (BNA) at 883. 

6 Cross v. lizuka, 753 F.2d 1047, 224 U.S.P.Q. (BNA) 739 (Fed. Cir. 1985). 

7 Id at 1050, 224 U.S.P.Q. (BNA) at 747. 

8 

Id 

In re Gazave, 379 F.2d 973, 154 U.S.P.Q. (BNA) 92 (C.C.P.A. 1967). 

10 

Ibid 

" In re Longer, 503 F.2d 1380,1391, 183 U.S.P.Q. (BNA) 288, 297 (C.C.P.A. 1974). 

12 See also In re Jolles, 628 F.2d 1322, 206 USPQ 885 (C.C.P.A. 1980); In re Irons, 340 F.2d 974, 144 
USPQ351 (\965), In re Sichert, 566 F. 2d 1154, 1159, 196 USPQ 209, 2 12-1 3 (C.C.P.A. 1977). 
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Compliance with 35 U.S.C. §101 is a question of fact. The evidentiary standard to be 
used throughout ex parte examination in setting forth a rejection is a preponderance of the 

14 

totality of the evidence under consideration. Thus, to overcome the presumption of truth that 
an assertion of utility by the Appellant enjoys, the Examiner must establish that it is more likely 
than not that one of ordinary skill in the art would doubt the truth of the statement of utility. 
Only after the Examiner made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the Applicant. The issue will then be decided on the totality of evidence. 

The well established case law is clearly reflected in the Utility Examination Guidelines 

("Utility Guidelines"), which acknowledge that an invention complies with the utility 
requirement of 35 U.S.C. §101, if it has at least one asserted "specific, substantial, and credible 
utility" or a "well-established utility." Under the Utility Guidelines, a utility is "specific" when 
it is particular to the subject matter claimed. For example, it is generally not enough to state that 
a nucleic acid is useful as a diagnostic without also identifying the conditions that are to be 
diagnosed. 

In explaining the "substantial utility" standard, M.P.E.P. §2107.01 cautions, however, 
that Office personnel must be careful not to interpret the phrase "immediate benefit to the 
public" or similar formulations used in certain court decisions to mean that products or services 
based on the claimed invention must be "currently available" to the public in order to satisfy the 
utility requirement, "Rather, any reasonable use that an applicant has identified for the invention 
that can be viewed as providing a public benefit should be accepted as sufficient, at least with 

regard to defining a 'substantial' 'utility.'" 16 Indeed, the Guidelines for Examination of 

17 

Applications for Compliance With the Utility Requirement, gives the following instruction to 

13 Raytheon v. Roper, 724 F.2d 951, 956, 220 U.S.P.Q. (BNA) 592, 596 (Fed. Cir. 1983) cert, denied, 469 
US 835 (1984). 

14 In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d (BNA) 1443, 1444 (Fed. Cir. 1992). 

15 66 Fed. Reg. 1092 (2001). 

16 

M.P.E.P. §2107.01. 
17 M.P.E.P. §2107 11(B)(1). 

-9- 

On Appeal to the Board of Patent Appeals and Interferences 

Appellants' Brief 
Application Serial No. 09/905,348 
Attorney's Docket No. GNE-1618P2C18 



patent examiners: "If the Applicant has asserted that the claimed invention is useful for any 
particular practical purpose . . . and the assertion would be considered credible by a person of 
ordinary skill in the art, do not impose a rejection based on lack of utility." 

B. Proper Application of the Legal Standard 

Appellants submit that the evidentiary standard to be used throughout ex parte 
examination of a patent application is a preponderance of the totality of the evidence under 
consideration. Thus, to overcome the presumption of truth that an assertion of utility by the 
Applicant enjoys, the Examiner must establish that it is more likely than not that one of ordinary 
skill in the art would doubt the truth of the statement of utility. Only after the Examiner has 
made a proper prima facie showing of lack of utility, does the burden of rebuttal shift to the 
Applicant. 

Appellants respectfully submit that the data presented in Example 92 starting on page 222 
of the specification and the cumulative evidence of record support a "specific, substantial and 
credible" asserted utility for the presently claimed invention. 

Patentable utility for the PR0232 polypeptides is based upon the gene amplification data 
for the gene encoding the PR0232 polypeptide. Example 92 describes the results obtained using 
a very well-known and routinely employed polymerase chain reaction (PCR)-based assay, the 
TaqMan IM PCR assay, also referred to herein as the gene amplification assay. This assay allows 
one to quantitatively measure the level of gene amplification in a given sample, say, a tumor 
extract, or a cell line. It was well known in the art at the time the invention was made that gene 
amplification is an essential mechanism for oncogene activation. Appellants isolated genomic 
DNA from a variety of primary cancers and cancer cell lines that are listed in Table 9 (pages 222 
onwards of the specification), including primary lung and colon cancers of the type and stage 
indicated in Table 8 (page 227). The tumor samples were tested in triplicates with Taqman™ 
primers and with internal controls, beta-actin and GADPH in order to quantitatively compare 
DNA levels between samples (page 229). As a negative control, DNA was isolated from the 
cells of ten normal healthy individuals, which was pooled and used as a control (page 222, 
lines 28-29). The results of TaqMan™ PCR are reported in ACt units, as explained in the 
passage on page 222, lines 37-39. One unit corresponds to one PCR cycle or approximately a 
2-fold amplification, relative to control, two units correspond to 4-fold, 3 units to 8-fold 
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amplification and so on. Using this PCR-based assay, Appellants showed that the ge^e encoding 
for PR0232 was amplified, that is, it showed approximately 1.04-2.40 ACt units for five lung 
tumors and 1.00-2.41 ACt units for seven colon tumors, which corresponds to 2 1 04 -2 2 ' 40 - fold 
amplification in lung or to 2 1 00 -2 2 41 - fold amplification in colon tumors; that is 2.056-fold to 
5.28-fold in five lung tumors or 2.00-fold to 5.32-fold in seven colon tumors , which would be 
considered significant and credible by one skilled in the art. Therefore, the PR0232 gene and 
the PR0232 polypeptide are important diagnostic markers to identify such malignant lung or 
colon cancers. 

The Examiner has asserted that only five out of the nineteen lung cancer samples tested 
positive and only seven out of 17 colon tumor samples tested positive for the claimed 
polypeptide, thus, if a sample was taken from an individual with lung cancer for diagnosis, it is 
allegedly more likely than not that this assay would yield a false negative result, (Page 4 of the 
Office Action mailed December 27, 2007) 

Appellants point out that, as any skilled artisan in the field of oncology would easily 
appreciate, not all tumor markers are generally associated with every tumor, or even, with most 
tumors. In fact, some tumor markers are useful for identifying rare malignancies . That is, the 
association of the tumor marker with a particular type of tumor lesion may be rare, or, the 
occurrence of that particular kind of tumor lesion itself may be rare. In either event, even these 
rare tumor markers, which may not give a positive hit with most common tumors, have great 
value in tumor diagnosis, and consequently, in tumor prognosis . For example, the article by 
Hanna and Mornin (submitted with the Response filed August 9, 2004), discloses that the known 
breast cancer marker HER-2/neu is "amplified and/or overexpressed in 10%-30% of invasive 
breast cancers and in 40%-60% of intraductal breast carcinoma" (page 1, col. 1). The skilled 
artisan would know that such tumor markers are very useful for better classification of tumors. 
Therefore, whether the PR0232 gene is amplified in most of the lung or colon tumors is not 
relevant to its identification as a tumor marker, or its patentable utility. Rather, whether the 
amplification data for PRQ232 is significant is what lends support to its usefulness as a tumor 
marker . 

It was well known in the art at the time of filing of the application that gene 
amplification, which occurs in most solid tumors like lung and colon cancers, is generally 
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associated with poor prognosis. Therefore, the PR0232 gene becomes an important diagnostic 
marker to identify such malignant lung and colon cancers, even if the malignancy associated 
with the PRQ232 molecule is a rare occurrence and the present specification clearly discloses 
enough evidence such that, one skilled in the art would know that PR0232 is useful as a valuable 
diagnostic marker for identifying certain types of lung and colon cancers. 

Appellants further note that the tumors listed in Table 8 are not similar tumors from 
different patients, but various types/classes of lung and/or colon tumors at different stages. 
Accordingly, a positive result from one tumor, where the nucleic acid was amplified, but not 
from other tumors, indicates that the nucleic acid can be used as a marker for diagnosing the 
presence of that kind of tumor in which it was amplified. Amplification of the nucleic acid 
would be indicative of that specific class of lung or colon tumor, whereas absence of 
amplification would be non-conclusive. The skilled artisan would certainly know that such 
tumor markers are useful for better classification of tumors. Therefore, whether the PR0232 
gene is amplified in five lung tumors or in all lung tumors is not relevant to its identification as a 
tumor marker, or its patentable utility. The fact that not all lung tumors tested positive in this 
study does not make the gene amplification data less significant. Rather, the fact that the 
amplification data for PR0232 is considered significant is what lends support to its usefulness as 
a tumor marker. If the goal is to diagnose lung and colon cancers, then contrary to the 
Examiner's assertion, a positive result does indicate the presence of cancer, while a negative 
result is not conclusive, and requires follow up testing. 

The Examiner has further alleged, based on Sen et al. and Hittelman et ah, that the 
observed gene amplification was not corrected for aneuploidy. The Examiner has also asserted it 
is not clear whether PR0232 is amplified in cancerous lung tissue more than in damaged lung 
tissue. (Pages 4-6 of the Office Action mailed December 27, 2007) 

Appellants respectfully disagree and submit that their gene amplification data was not 
due to aneuploidy. Appellants have submitted the Ashkenazi Declaration to show that "detection 
of gene amplification can be used for cancer diagnosis even if the determination includes 
measurement of chromosomal aneuploidy." Regarding Sen et al. and Hittelman et al, 
Appellants agree that while aneuploidy can be a feature of damaged tissue as well, besides 
cancerous or pre-cancerous tissue, and may not invariably lead to cancer, Sen et al. in fact 
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support the Appellants' position that PR0232 is still useful in diagnosing pre-cancerous lesions 
or cancer itself. For instance, the art in lung cancer at the time of filing of the instant application 
clearly described that "epithelial tumors develop through a multistep process driven by. genetic 
instability" in damaged lung lesions which may eventually lead to lung cancer. Many articles 
published around June 23, 1999 (the effective filing date of this application) studied such 
damaged or premalignant lesions and suggested that identification of such pre-cancerous lesions 
were very important in preventive diagnosis and treatment of lung cancer. Based on the well- 
known art, Appellants submit that there is utility in identifying genetic biomarkers in epithelial 
tissues at cancer risk. 

The Examiner has also addressed the pooled blood controls used in the gene 
amplification assay and asserted that the controls were not matched, non-tumor lung samples, but 
rather pooled DNA samples from blood of healthy subjects, citing Pennica et al and Konopka et 
al as supporting references. (Page 10 of the Office Action mailed December 27, 2007) 

Appellants respectfully submit that the Examiner's position is scientifically incorrect 
because the instant application relies on genomic DNA amplification for utility and not cDNA 
expression. Different types of cells from the same organism should have the same set of 
genomic DNA. Thus, it does not matter what kind of cells you use for the control as long as the 
control cells have the entire genome. Accordingly, a "tissue-matched" control is not necessary in 
the gene amplification assay. 

Appellants further point out that Pennica et al. teaches the exact same "pooled normal 
blood controls" as that used in the instant gene amplification assay (for instance, see page 14718, 
column 1 and Figure 5 of Pennica et al.). Further, the references Bieche et al. and Pitti et al., 
submitted as Exhibits F and G with the Goddard Declaration, also used "pooled normal blood 
controls" as control. For instance, in Pitti et al. the authors used the same quantitative TaqMan 
PCR assay and pooled normal blood controls described in the instant specification, to study gene 
amplification in lung and colon cancer of DcR3, a decoy receptor for Fas ligand. Pitti et al. 
analyzed DNA copy number "in genomic DNA from 35 primary lung and colon tumors, relative 
to pooled genomic DNA from peripheral blood leukocytes (PBL) of 10 healthy donors." (Page 
701, col. 1). The authors also analyzed mRNA expression of DcR3 in primary tumor tissue 
sections and found tumor-specific expression, confirming the finding of frequent amplification in 
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tumors, and confirming that the pooled blood sample was a valid negative control for the gene 
amplification experiments. In Bieche et al., the authors used the quantitative TaqMan PCR assay 
to study gene amplification of myc, ccndl and erbB2 in breast tumors. As their negative control, 
Bieche et al. used normal leukocyte DNA derived from a small subset of the breast cancer 
patients (page 663). The authors note that "[t]he results of this study are consistent with those 
reported in the literature" (page 664, col. 2). Thus, contrary to the Examiner's allegations, 
Pennica et al, Pitti et al. and Bieche et al. in fact, confirm the validity of use of the "pooled blood 
control" as a negative controls, and indicate that this control was widely utilized in the art at the 
time of filing of the instant application. 

C. A prima facie case of lack of utility has not been established 

The Examiner has asserted, based on Pennica et al , Konopka et al , Godbout et al , and 
Li et al that there is a general lack of correlation between gene amplification and mRNA 
expression, and, thus, the data in Table 9 does not provide a basis for utility or enablement of the 
claimed polypeptides (Pages 4-7 of the Final Office Action mailed December 27, 2007). 

As a preliminary matter, Appellants respectfully submit that it is not a legal requirement 
to establish that gene amplification "necessarily" results in increased expression at the mRNA 
and polypeptide levels or that polypeptide levels can be "accurately predicted." As discussed 
above, the evidentiary standard to be used throughout ex parte examination of a patent 
application is a preponderance of the totality of the evidence under consideration. Accordingly, 
Appellants submit that in order to overcome the presumption of truth that an assertion of utility 
by the applicant enjoys, the Examiner must establish that it is more likely than not that one of 
ordinary skill in the art would doubt the truth of the statement of utility. Therefore, it is not 
legally required that there be a "necessary" correlation between the data presented and the 
claimed subject matter. The law requires pnly that one skilled in the art should accept that such a 
correlation is more likely than not to exist . Appellants respectfully submit that when the proper 
evidentiary standard is applied, a correlation must be acknowledged. Only after the Examiner 
has made a proper prima facie showing of lack of utility, does the burden of rebuttal shift to the 
applicant. The Examiner failed to meet this evidentiary burden. 
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Pennica et al. 

Appellants submit that Pennica et al. does not show a lack of correlation between gene 
(DNA) amplification and mRNA levels. According to the quoted statement from Pennica et al., 
"WISP-l gene amplification in human lung tumors showed a correlation between DNA 
amplification and over-expression, whereas overexpression of WISP-3 RNA was seen in the 
absence of DNA amplification. In contrast, WISP-2 DNA was amplified in lung tumors, but its 
mRNA expression was significantly reduced in the majority of tumors compared with expression 
in normal colonic mucosa from the same patient." From this, the Examiner correctly concludes 
that increased copy number does not necessarily result in increased polypeptide expression. The 
standard, however, is not absolute certainty. The fact that in the case of a specific class of 
closely related molecules there seemed to be no correlation with gene amplification and the level 
of mRNA/protein expression, does not establish that it is more likely than not, in general, that 
such correlation does not exist. The Examiner has not shown whether the lack or correlation 
observed for the family of WISP polypeptides is typical, or is merely a discrepancy, an exception 
to the rule of correlation . Indeed, the working hypothesis among those skilled in the art is that, if 
a gene is amplified in cancer, the encoded protein is likely to be expressed at an elevated level. 
In fact, as noted even in Pennica et al, "[a]n analysis of WISP-l gene amplification and 
expression in human lung tumors showed a correlation between DNA amplification and over- 
expression . . . ." (Pennica et al. 9 pagel4722, left column, first full paragraph, emphasis added). 

Accordingly, Appellants respectfully submit that Pennica et al. teaches nothing 
conclusive regarding the absence of correlation between amplification of a gene and over- 
expression of the encoded WISP polypeptide. More importantly, the teaching of Pennica et al. is 
specific to WISP genes. Pennica et al. has no teaching whatsoever about the correlation of gene 
amplification and protein expression in general . 

Konopka et al. 

Regarding Konopka et al, Applicants submit that the Examiner has completely 
misinterpreted the teachings in the cited reference. Contrary to the Examiner's assertions, 
Konopka et al. does not support the position that DNA amplification is not correlated with 
mRNA overexpression . Konopka et al. show only that, of the cell lines known to have increased 
abl protein expression, only one had amplification of the abl gene (page 4051, col. 1). This 
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result proves only that increased mRNA and protein expression levels can result from causes 
other than gene amplification. Konopka et al. do not demonstrate that when gene amplification 
does occur, it does not result in increased mRNA and protein expression levels, particularly 
given that the cell line with amplification of the abl gene did show increased abl mRNA and 
protein expression levels. Furthermore, Konopka et al. supports Appellants' position that mRNA 
levels correlate with protein levels. Konopka et al state that "the 8-kb mRNA that encodes 
P210 c abl was detected at a 10-fold higher level in SK-CML7bt-333 ( Fig. 3 A, +) than in SK- 
CML16BM (B, +), which correlated with the relative level of P210 c ' abl detected in each cell 
line. Analysis of additional cell lines demonstrated that the level of 8-kb mRNA directly 
correlated with the level of P210 c " abl (Table 1)" (page 4050, col. 2, emphasis added). 

Godbout et al. 

Regarding Godbout, the Examiner has asserted that Godbout et al. teaches that "a number 
of studies suggest that co-amplified genes are only overexpressed if they provide a selective 
advantage to the cells in which they are amplified." The Examiner further asserts that Godbout 
teaches "[i]t is generally accepted that co-amplified genes are not over-expressed unless they 
provide a selective growth advantage to the cell." (Pages 6-7 of the Office Action mailed 
December 27, 2007). 

Appellants have previously made of record three more recent references, published in 
2002, by Orntoft et al, Hyman et al, and Pollack et al, (made of record in Applicants' Response 
filed on August 9, 2004), which collectively teach that in general, gene amplification increases 
mRNA expression . Appellants submit that these more recent references must be acknowledged 
as more accurately reflecting the state of the art regarding the correlation between gene 
amplification and transcript expression than the references cited by Godbout et al. 

Applicants further maintain that Godbout et al report that "there is a good correlation 
with DDX1 gene copy number, DDX1 transcript levels, and DDX1 protein levels in all cell lines 
studied." Thus, in these cancer cell lines, DDX1 mRNA and protein levels are correlated. 

Moreover, selective advantage to cell survival is not the only mechanism by which genes 
impact cancer. Mechanistic data is not a requirement for the utility requirement. Hence, this 
rejection is improper. Applicants respectfully submit that, as discussed above, Orntoft et al, 
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Hyman et al, and Pollack et al, (of record), collectively teach that gene amplification increases 
mRNA expression for large numbers of genes, which have not been identified as being 
oncogenes or as conferring any selective growth advantage on tumor cells. Thus, the art of 
record clearly shows that there is no requirement that a polypeptide must be a known oncogene 
or a protein otherwise known to be associated with tumor growth, in order for amplification of 
the gene encoding the protein to correlate with increased protein expression. In fact, as 
demonstrated by Orntoft et ai, Hyman et al, and Pollack et al, examination of gene 
amplification is a useful way to identify novel proteins not previously known to be associated 
with cancer. 

The Examiner has asserted that unlike Godbout et al, the instant specification does not 
teach structure/ function analysis and the Examiner questions whether the level of genomic 
amplification of DDX1 gene is comparable to that disclosed by PR0232. (Page 12of the Office . 
Action mailed in December 27, 2007) 

Appellants respectfully submit that it was never claimed that PR0232 is similar in any 
way to the DDX1 gene of Godbout et al, they never claimed PR0232 was an RNA helicase or 
that it confers selective advantage to cell survival; on the other hand, the Godbout reference was 
submitted to show good correlation between protein levels based upon genomic DNA 
amplification, which the Examiner clearly agrees with. Moreover, selective advantage to cell 
survival is not the only mechanism by which genes impact cancer. Structure/function data, 
which the Examiner requests, is not a requirement for the utility requirement. Hence this 
rejection is improper. 

Li et al. 

The Examiner has cited Li et al. as teaching that "68.8% of the genes showing over- 
representation in the genome did not show elevated transcript levels." (Page 8 of the Office 
Action mailed in December 27, 2007) 

Appellants respectfully point out that Li et al. acknowledge that their results differed 

from those obtained by Hyman et al. and Pollack et al. (of record), who found a substantially 

higher level of correlation between gene amplification and increased gene expression. The 

authors note that "[t]his discordance may reflect methodologic differences between studies or 

biological differences between breast cancer and lung adenocarcinoma" (page 2629, col. 1). For 
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instance, as explained in the Supplemental Information accompanying the Li article, genes were 
considered to be amplified if they had a copy number ratio of at least 1 .40 . In the case of 
PRO830, as discussed in previously filed responses and in the Goddard Declaration (of record), 
an appropriate threshold for considering gene amplification to be significant is a copy number of 
at least 2.0 (which is a higher threshold). The PR0232 gene showed significant amplification of 
2.0-fold to 5.2-fold in twelve different lung and colon primary tumors , and thus fully meets this 
standard. It is not surprising that in the Li et al reference, by using a lower threshold of 1 .4 for 
considering gene amplification, a higher number of genes not showing corresponding increases 
in mRNA expression were found. Moreover, Appellants add that the results of Li et al. do not 
conclusively disprove that a gene with a substantially higher level of Rene amplification, such as 
PRQ232 , would be expected to show a corresponding increase in transcript expression. 

In response to Applicants' argument that the discordance may reflect methodologic 
differences, the Examiner asserts that "Li et al. did not limit their studies to genes that were 
amplified at less than 2-fold." In support of this assertion, the Examiner cites the first paragraph 
of the Supplemental Material. (Page 13 of the Office Action mailed in December 27, 2007) 

Applicants respectfully point out that the Examiner has misinterpreted the methodology 
disclosed in the supplemental material. The evidence cited by the Examiner pertains to the 
inclusion criteria of the probes used for defining amplicons. In the second paragraph entitled 
"Relationship between genomic copy number and gene transcript level", the authors state that 
"[f]or each gene, the CGH data were represented by a vector that was labeled 4 1' for genomic 
overrepresentation (including amplification) ratio greater than 1.40 and '0' for no genomic 
overrepresentation." Nevertheless, the Examiner acknowledges that the alleged 2-fold 
amplification criteria would only apply to some of the samples. The Examiner has not 
established that a correlation does not exist in samples based solely on this threshold. 

In summary, the Patent Office has failed to meet its initial burden of proof that 
Applicants' claims of utility are not substantial or credible. The arguments presented by the 
Examiner in combination with the cited articles do not provide sufficient reasons to doubt the 
statements by Applicants that PR0232 has utility. As discussed above, the law does not require 
that DNA amplification is "always" associated with overexpression of the gene product. 
Therefore, Applicants submit that the Examiner's reasoning is based on a misrepresentation of 
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the scientific data presented in the above cited reference and application of an improper, 
heightened legal standard. In fact, contrary to what the Examiner contends, the art indicates that, 
if a gene is amplified in cancer, it is more likely than not that the encoded protein will be 
expressed at an elevated level. 

It is "more likely than not" for amplified genes to have increased mRNA 

On the contrary, Appellants submit that Example 92 of the specification further discloses 
that, "(amplification is associated with overexpression of the gene product, indicating that the 
polypeptides are useful targets for therapeutic intervention in certain cancers such as lung, colon, 
breast and other cancers and diagnostic determination of the presence of those cancers" 
(Emphasis added). Besides, Appellants have submitted ample evidence to show that, in general, 
if a gene is amplified in cancer, it is "more likely than not" that the corresponding mRNA will 
also be expressed at an elevated level. 

For instance, Appellants presented the articles by Orntoft et al, Hyman et al, and 
Pollack et al (made of record in Appellants' Response filed August 9, 2004), who collectively 
teach that in general, for most Renes, DNA amplification increases mRNA expression . The 
results presented by Orntoft et al, Hyman et al, and Pollack et al are based upon wide ranging 
analyses of a large number of tumor associated genes. Orntoft et al. studied transcript levels of 
5600 genes in malignant bladder cancers, many of which were linked to the gain or loss of 
chromosomal material, and found that in general (18 of 23 cases) chromosomal areas with more 
than 2-fold gain of DNA showed a corresponding increase in mRNA transcripts. Hyman et al 
compared DNA copy numbers and mRNA expression of over 12,000 genes in breast cancer 
tumors and cell lines, and found that there was evidence of a prominent global influence of copy 
number changes on gene expression levels. In Pollack et al, the authors profiled DNA copy 
number alteration across 6,691 mapped human genes in 44 predominantly advanced primary 
breast tumors and 10 breast cancer cell lines, and found that on average, a 2-fold change in DNA 
copy number was associated with a corresponding 1.5-fold change in mRNA levels. In 
summary, the evidence supports the Appellants' position that gene amplification is more likely 
than not predictive of increased mRNA and polypeptide levels. 

Thus, the Examiner appears to disregard the ample evidence provided in the above 

referenced articles based on misinterpretations of their teachings. Appellants submit that in fact, 
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these articles lend significant support that for an amplified gene, it is more likely than not that the 
protein will also be overexpressed and would be viewed as reasonable and credible by one of 
ordinary skill in the art. The "more likely than not" standard is a much lower standard than a 
"necessary" correlation or "accurate" prediction, and is clearly met in the claimed invention. 
Moreover, the Examiner has not cited any evidence or advanced any arguments as to why 
Appellants' statement of overexpression of protein would not be credible. Accordingly, this 
point is believed to be moot. 



Even if a prima facie case of lack of utility has been established, it should be 
withdrawn on consideration of the totality of evidence 

Even if one assumes arguendo that it is more likely than not that there is no correlation 

between gene amplification and increased mRNA/protein expression, which Appellants submit is 

not true, a polypeptide encoded by a gene that is amplified in cancer would still have a specific, 

substantial, and credible utility. In support, Appellants respectfully draw the Board's attention to 

page 2 of the Declaration of Dr. Avi Ashkenazi (submitted with the Response filed December 

10, 2003) which explains that, 

even when amplification of a cancer marker gene does not result in significant 
over-expression of the corresponding gene product, this very absence of gene 
product over-expression still provides significant information for cancer diagnosis 
and treatment. Thus, if over-expression of the gene product does not parallel gene 
amplification in certain tumor types but does so in others, then parallel monitoring 
of gene amplification and gene product over-expression enables more accurate 
tumor classification and hence better determination of suitable therapy. In 
addition, absence of over-expression is crucial information for the practicing 
clinician. If a gene is amplified but the corresponding gene product is not over- 
expressed, the clinician accordingly will decide not to treat a patient with agents 
that target that gene product. 

Appellants thus submit that simultaneous testing of gene amplification and gene product 
over-expression enables more accurate tumor classification, even if the gene-product, the protein, 
is not over-expressed. This leads to better determination of a suitable therapy. Further, as 
explained in Dr. Ashkenazi's Declaration, absence of over-expression of the protein itself is 
crucial information for the practicing clinician. If a gene is amplified in a tumor, but the 
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corresponding gene product is not over-expressed, the clinician will decide not to treat a patient 
with agents that target that gene product. This not only saves money, but also has the benefit that 
the patient can avoid exposure to the side effects associated with such agents. 

This utility is further supported by the teachings of the article by Hanna and Mornin. 
(Pathology Associates Medical Laboratories, August (1999), submitted with the Response filed 
August 9, 2004). The article teaches that the HER-2/neu gene has been shown to be amplified 
and/or over-expressed in 10%-30% of invasive breast cancers and in 40%-60% of intraductal 
breast carcinomas. Further, the article teaches that diagnosis of breast cancer includes testing 
both the amplification of the HER-2/neu gene (by FISH) as well as the over-expression of the 
HER-2/neu gene product (by IHC). Even when the protein is not over-expressed, the assay 
relying on both tests leads to a more accurate classification of the cancer and a more effective 
treatment of it. 

Appellants have clearly shown that the gene encoding the PR0232 polypeptide is 
amplified in at least five lung tumors and seven colon tumors. Therefore, the PR0232 gene, 
similar to the HER-2/neu gene disclosed in Hanna et ai, is a tumor associated gene. 
Furthermore, as discussed above, in the majority of amplified genes, the teachings in the art 
overwhelmingly show that gene amplification influences gene expression at the mRNA and 
protein levels. Therefore, one of skill in the art would reasonably expect in this instance, based 
on the amplification data for the PR0232 gene, that the PR0232 polypeptide is concomitantly 
overexpressed. 

Thus, based on the asserted utility for PR0232 in the diagnosis of selected lung or colon 
tumors, the reduction to practice of the instantly claimed protein sequence of SEQ ID NO: 18 in 
the present application (also see pages 3- 4 and Example 4, pages 149-150), the step-by-step 
preparation of chimeric PRO polypeptides, including those wherein the heterologous polypeptide 
is an epitope tag or an Fc region of an immunoglobulin (page 74, line 23 to page 75, line 5), the 
description of the expression of PRO polypeptides in various host cells, including E. coli, 
mammalian cells, yeast and Baculovirus-infected insect cells at least in Examples 53-56, 
pages 192-199, the disclosure of the step-by-step protocol for the preparation, isolation and 
detection of monoclonal, polyclonal and other types of antibodies against the PR0232 protein in 
the specification (monoclonal and polyclonal antibodies at page 139, line 32, to page 141, 
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line 13; humanized antibodies at page 141, line 15, to page 142, line 16; antibody fragments at 
page 143, line 8 onwards; labeled antibodies at pages 144-145, line 16 onwards and page 146, 
line 33 to page 147, line 6) and the disclosure of the gene amplification assay in Example 92, the 
skilled artisan would know exactly how to make and use the claimed polypeptides for the 
diagnosis of lung or colon cancers. Appellants submit that based on the detailed information 
presented in the specification and the advanced state of the art in oncology, the skilled artisan 
would have found such testing routine and not 'undue.' 

Therefore, since the instantly claimed invention is supported by either a credible, specific 
and substantial asserted utility or a well-established utility, one skilled in the art would know 
"how to make and use" the claimed invention without undue experimentation, Appellants 
respectfully request reconsideration and reversal of the determination of priority for 
Claims 44-46 and 49-51. 

ISSUE 2. Claims 44-46 and 49-51 should be entitled to the priority date of U.S. Provisional 
Patent Application Serial No. 60/059121, filed September 17, 1997 

The instant application has not been granted the earlier priority date of U.S. Provisional 
Patent Application Serial No. 60/059121, filed September 17, 1997 on the grounds that the prior 
60/059121 application fails to provide a utility and lacks an enabling disclosure for the claimed 
invention under 35 U.S. C.§§ 101/1 12, first paragraph." 

Appellants disagree and submit that, for the same reasons discussed above under Issue 1 , 
U.S. Provisional Patent Application Serial No. 60/059121 also satisfies the utility requirements. 
Therefore, Appellants should be entitled to the priority date of September 17, 1997. 

ISSUE 3. Claims 44-46 and 49-51 are not anticipated by Rosenthal et g/„ DE19818619-A1 
(dated 10/1999) 

Claims 44-46 and 49-51 remain rejected under 35 U.S.C. § 102(b) as being anticipated by 
Rosenthal etai, DE19818619-A1 (dated 10/1999). 

For the reasons discussed above under Issue 2, Appellants maintain that they are entitled 
to an effective filing date of September 17, 1997 based on a properly claimed priority to 
International application PCT/US98/18824, Therefore, Rosenthal et al. is not prior art and does 
not anticipate the instant claims. Accordingly, this rejection under 35 U.S.C. § 102(b) should be 
withdrawn. 
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CONCLUSION 



For the reasons given above, Appellants submit that present specification and the 
specification of U.S. Provisional Patent Application Serial No. 60/059121 dated 
September 17, 1997 clearly describes and provides at least one patentable utility for the instantly 
claimed invention. Moreover, it is respectfully submitted that the present specification clearly 
teaches "how to use" the presently claimed polypeptide based upon this disclosed patentable 
utility. Accordingly, Rosenthal et aL, DE19818619-A1 is not prior art. As such, Appellants 
respectfully request reconsideration and reversal of the outstanding rejection of Claims 44-46 
and 49-51. 

The Commissioner is authorized to charge any fees which may be required, including 
extension fees, or credit any overpayment to Deposit Account No. 50-4634 (referencing 
Attorney' s Docket No. 123851-181890 (GNE-1618P2C18 ). 



GOODWIN PROCTER LLP 

135 Commonwealth Drive 
Menlo Park, California 94025 
Telephone: (650) 752-3100 
Facsimile: (650) 853-1038 



Respectfully submitted, 



Date: September 29, 2008 
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VIII. CLAIMS APPENDIX 

Claims on Appeal 

44. An isolated polypeptide comprising: 

(a) the amino acid sequence of the polypeptide of.SEQ ID NO: 18; 

(b) the amino acid sequence of the polypeptide of SEQ ID NO: 18, lacking its 
associated signal peptide; or 

(c) the amino acid sequence of the polypeptide encoded by the full-length coding 
sequence of the cDNA deposited under ATCC accession number 209250; 

wherein, the nucleic acid encoding said polypeptide is amplified in lung or colon tumors. 

45. The isolated polypeptide of Claim 44 comprising the amino acid sequence of the 
polypeptide of SEQ ID NO: 1 8. 

46. The isolated polypeptide of Claim 44 comprising the amino acid sequence of the 
polypeptide of SEQ ID NO: 18, lacking its associated signal peptide. 

49. The isolated polypeptide of Claim 44 comprising the amino acid sequence of the 
polypeptide encoded by the full-length coding sequence of the cDNA deposited under ATCC 
accession number 209250. 

50. A chimeric polypeptide comprising a polypeptide according to Claim 44 fused to 
a heterologous polypeptide. 

5 1 . The chimeric polypeptide of Claim 50, wherein said heterologous polypeptide is 
an epitope tag or an Fc region of an immunoglobulin. 
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IX. EVIDENCE APPENDIX 

1. Declaration of Avi Ashkenazi, Ph.D. under 35 C.F.R. §1.132, with attached Exhibit A 
(Curriculum Vitae). 

2. Declaration of Audrey D. Goddard, Ph.D. under 37 C.F.R. §1.132, with attached Exhibits 
A-G: 

A. Curriculum Vitae of Audrey D. Goddard, Ph.D. 

. B. Higuchi; R. et aL, "Simultaneous amplification and detection of specific 
DNA sequences," Biotechnology 10:413-417 (1992). 

C. Livak, K.J., et aL, "Oligonucleotides with fluorescent dyes at opposite 
ends provide a quenched probe system useful for detecting PCR product 
and nucleic acid hybridization," PCR Methods Appl. 4:357-362 (1995). 

D. Heid, C.A. et aL, "Real time quantitative PCR," Genome Res. 6:986-994 
(1996). 

E. Pennica, D. et aL, "WISP genes are members of the connective tissue 
growth factor family that are up-regulated in Wnt- 1 -transformed cells and 
aberrantly expressed in human colon tumors," Proc. Natl. Acad Sci. USA 
95:14717-14722 (1998). 

F. Pitti, R.M. et aL, "Genomic amplification of a decoy receptor for Fas 
ligand in lung and colon cancer," Nature 396:699-703 (1998). 

G. Bieche, I. et aL, "Novel approach to quantitative polymerase chain 
reaction using real-time detection: Application to the detection of gene 
amplification in breast cancer," Int. J. Cancer 78:661-666 (1998). 

3. Declaration of Paul Polakis, Ph.D. under 37 C.F.R. §1.132. 

4. Hyman, E., et aL, "Impact of DNA Amplification on Gene Expression Patterns in Breast 
Cancer," Cancer Research 62:6240-6245 (2002). 

5. Pollack, J.R., et aL, "Microarray Analysis Reveals a Major Direct Role of DNA Copy 
Number Alteration in the Transcriptional Program of Human Breast Tumors," Proc. Natl. 
Acad. Sci. USA 99:12963-12968 (2002). 

6. Orntoft, T.F., et aL Molecular & Cellular Proteomics - 1 :37-45 (2002). 

7. Hanna et aL, "HER-2/neu Breast Cancer Predictive Testing," Pathology Associates 
Medical Laboratories (1999). 

8. Declaration of Paul Polakis, Ph.D. under 35 C.F.R. §1.132 (Polakis II). 

9. Godbout, R., et aL, J Biol Chem, - 273(33):21 161-8 (1998). 

10. Rosenthal et aL, DE19818619-A1 (dated October 28, 1999). 
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1 1 . Sen, S., "Aneuploidy and cancer," Curr. Opin Oncol 12:82-88 (2000). 



12. Li et al. 9 2006, Oncogene, 25: 2628-2635. 

13. Konopka et al , "Variable Expression of the Translocated c-abl oncogene in Philadelphia- 
chromosome-positive B-lymphoid cell lines from chronic myelogenous leukemia 
patients" Proc, Natl Acad Set USA 83: 4049-52, (1986). 

14. Hittelman, 2001, Ann. N. Y. Acad. Sci. 952:1-12. 

Item 1 was submitted with Appellants' Response filed December 10, 2003, and was considered 
by the Examiner as indicated in the Office action mailed April 8, 2004. 

Item 2 was submitted with Appellants' Response filed July 25, 2005, and were considered by the 
Examiner as indicated in the second Final Office action mailed October 21, 2005. 

Items 3-7 were submitted with Appellants' Response filed August 9, 2004, and were considered 
by the Examiner as indicated in the Office action mailed December 6, 2004. 

Items 8 and were submitted with Appellants' Response filed July 27, 2006, and was considered 
by the Examiner as indicated in the Final Office Action mailed October 17, 2006. 

Item 9 was submitted with Appellants' Appeal Brief filed September 24, 2007 and made of 
records by the Examiner in the Final Office Action mailed December 27, 2007. 

Item 1 (E), i.e. : Pennica et al, and Items 10-11 were made of record by the Examiner in the 
Office Action mailed September 29, 2003. 

Item 12 was made of record by the Examiner in the Final Office Action mailed October 17, 
2006. 

Items 13 and 14 were made of record by the Examiner in the Final Office Action mailed 
December 27, 2007. 
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X. RELATED PROCEEDINGS APPENDIX 

None- no decision rendered by a Court or the Board in any related proceedings identified 

above. 



LIBC/339271I.I 
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PATENT 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Application of: Ashkenazi et al. 



Group Art Unit: 1647 



Serial No.: 09/903,925 



Examiner: Fozia Hamid 



Filed: July 11,2001 




For: SECRETED AND 



TRANSMEMBRANE . 
POLYPEPTIDES AND NUCLEIC 
ACIDS 



DECLARATION OF AUDREY D. GODDARD, Ph.D UNDER 37 C.F.R. $ 1.132 

Assistant Co mmi ssioner of Patents 
Washington, D.C. 20231 

Sir. 

1, Audrey D. Goddard, Pn.D. do hereby declare and say as follows: 

1. I am a Senior Clinical Sciennst at IheExpermientalMedicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2. Between 1 993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and deterrnination 
of the chromosomal localization of novel genes. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to and 
forms part of this Declaration (Exhibit A). 



C 



Serial Nol: * 
Filed: * 

4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqManPCR (i.e., . 
"gene amplification") assay described in the above captioned patent application. 

5. The TaqMan PCR assay is described, for example, in the following scientific 
publications: Higuchi et aL, Biotechnology 10:413-417 (1992) (Exhibit B); Livak et a/., PCR 
Methods Add!,, 4:357-362 (1995) (Exhibit C> and Heid et al, Genome Res. 6:986-994.(1996) 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protoonco genes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica et al, Proc. 
Natl. Acad Sci. USA . 95(25): 14717-14722 (1998) (Exhibit E); Pitti et al, Nature 
396(6712);699-703 (1998) (Exhibit F) andBieche et al. Int. J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et al have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et al studied the genomic amplification of a decoy 

. receptor for Fas ligand in lung and colon cancer, using the quantitative TaqManPCR assay. Bieche 
et al used the assay to study gene amplification in breast cancer. 
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Serial No.:* 
Filed:* 

7. It is my personal experience that the quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 
control. It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i.e., non-tumor) sample is significant and 
useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown - 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

8. I" declare further that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereon. 

Date Audrey D. Goddard, Ph.D. 



AUDREY D. GODDARD, Ph.D. 



Genentech, Inc. 110 Congo St. 

1 DNA Way San Francisco, CA, 94131 

South San Francisco, CA, 94080 41 5.841 .91 54 

650.225.6429 41 5.81 9.2247 (mobile) 

goddarda@gene.com agoddard@pacbell.net 



PROFESSIONAL EXPERIENCE 

Genentech, Inc. 1993-present 
South San Francisco, CA 

2001 - present Senior Clinical Scientist 

Experimental Medicine / BioOncology, Medical Affairs 

Responsibilities: 

• Companion diagnostic oncology products 

• Acquisition of clinical samples from Genentech's clinical trials for translational research 

• Translational research using clinical specimen and data for drug development and 
diagnostics 

• Member of Development Science Review Committee, Diagnostic Oversight Team, 21 CFR 
Part 11 Subteam 

Interests: 

• Ethical and legal implications of experiments with clinical specimens and data 

• Application of pharmacogenomics in clinical trials 



1998 - 2001 Senior Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities: 

• Management of a laboratory of up to nineteen -including postdoctoral fellow, associate 
scientist, senior research associate and research assistants/associate levels 

• Management of a $750K budget 

• DNA sequencing core facility supporting a 350+ person research facility. 

• DNA sequencing for high throughput gene discovery, - ESTs, cDNAs, and constructs 

• Genomic sequence analysis and gene identification 

• DNA sequence and primary protein analysis 

Research: 

• Chromosomal localization of novel genes 

• Identification and characterization of genes contributing to the oncogenic process 

• Identification and characterization of genes contributing to inflammatory diseases 

• Design and development of schemes for high throughput genomic DNA sequence analysis 

• Candidate gene prediction and evaluation 
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1993-1998 Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities 

• DNA sequencing core facility supporting a 350+ person research facility 

• Assumed responsibility for a pre-existing team of five technicians and expanded the group 
into fifteen, introducing a level of middle management and additional areas of research 

• Participated, in the development of the basic plan for high throughput secreted protein 
discovery program - sequencing strategies, data analysis and tracking, database design 

• High throughput EST and cDNA sequencing for new gene identification. 

• Design and implementation of analysis tools required for high throughput gene identification. 

• Chromosomal localization of genes encoding novel secreted proteins. 

Research: 

• Genomic sequence scanning for new gene discovery. 

• Development of signal peptide selection methods. 

• Evaluation of candidate disease genes. 

• Growth hormone receptor gene SNPs in children with Idiopathic short stature 



Imperial Cancer Research Fund 1989-1992 
London, UK with Dr. Ellen Solomon 

6/89 -12/92 Postdoctoral Fellow 

• Cloning and characterization of the genes fused at the acute promyelocyte leukemia 
translocation breakpoints on chromosomes 17 and 15. 

• Prepared a successfully funded European Union multi-center grant application 

McMaster University 1983 
Hamilton, Ontario, Canada with Dr. G. D. Sweeney 

5/83 - 8/83: NSERC Summer Student 

• In vitro metabolism of p-naphthoflavone in C57BI/6J and DBA mice 



EDUCATION 
Ph.D. 

"Phenotypic and genotypic effects of mutations in 
the human retinoblastoma gene." 
Supervisor: Dr. R. A. Phillips 

Honours B.Sc 

"The in wYro metabolism of the cytochrome P-448 
inducer p-naphthoflavone in C57BL/6J mice." 
Supervisor: Dr. G. D. Sweeney 



University of Toronto 

Toronto, Ontario, Canada. 1989 

Department of Medical 

Biophysics. 

McMaster University, 

Hamilton, Ontario, Canada. 1983 

Department of Biochemistry 
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ACADEMIC AWARDS 



Imperial Cancer Research Fund Postdoctoral Fellowship 

Medical Research Council Studentship 

NSERC Undergraduate Summer Research Award 

Society of Chemical Industry Merit Award (Hons. Bioehem.) 

Dr. Harry Lyman Hooker Scholarship 

J.LW. Gill Scholarship 

Business and Professional Women's Club Scholarship 
Wyerhauser Foundation Scholarship 



1989-1992 
1983-1988 
1983 



1983 



1981-1983 
1981-1982 
1980-1981 
1979-1980 



INVITED PRESENTATIONS 

Genentech's gene discovery pipeline: High throughput identification, cloning and 
characterization of novel genes. Functional Genomics: From Genome to Function, Litchfield 
Park, AZ, USA. October 2000 

High throughput identification, cloning and characterization of novel genes. G2K:Back to 
Science, Advances in Genome Biology and Technology I. Marco Island, FL, USA. February 



Quality control in DNA Sequencing: The use of Phred and Phrap. Bay Area Sequencing 
Users Meeting, Berkeley, CA, USA. April 1999 

High throughput secreted protein identification and cloning. Tenth International Genome 
Sequencing and Analysis Conference, Miami, FL, USA. September 1998 

The evolution of DNA sequencing: The Genentech perspective. Bay Area Sequencing Users 
Meeting, Berkeley, CA, USA. May 1998 

Partial Growth Hormone Insensitivity: The role of GH-receptor mutations in Idiopathic Short 
Stature. Tenth Annual National Cooperative Growth Study Investigators Meeting, San 
Francisco, CA, USA. October, 1996 

Growth hormone (GH) receptor defects are present in selected children with non-GH-deficient 
short stature: A molecular basis for partial GH-insensitivity. 76 th Annual Meeting of The 
Endocrine Society, Anaheim, CA, USA. June 1994 

A previously uncharacterized gene, myl, is fused to the retinoic acid receptor alpha gene in 
acute promyelocyte leukemia. XV International Association for Comparative Research on 
Leukemia and Related Disease, Padua, Italy. October 1991 



2000 
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PATENTS 

Goddard A, Godowski P J, Gurney AL. NL2 Tie ligand homologue polypeptide. Patent 
Number: 6,455,496. Date of Patent: Sept. 24, 2002. 

Goddard A, Godowski PJ and Gurney AL NL3 Tie ligand homologue nucleic acids. Patent 
Number: 6,426,218. Date of Patent: July 30, 2002. 

Godowski P, Gurney A, Hillan KJ, Botstein D f Goddard A, Roy M, Ferrara N, Tumas D, 
Schwall R. NL4 Tie ligand homologue nucleic acid. Patent Number: 6,4137,770. Date of 
Patent: July 2, 2002. 

Ashkenazi A, Fong S, Goddard A, Gurney AL, Napier MA, Tumas D, Wood Wl. Nucleic acid 
encoding A-33 related antigen poly peptides. Patent Number: 6,410,708. Date of Patent" 
Jun. 25, 2002. 

Botstein DA, Cohen RL, Goddard AD, Gurney AL, Hillan KJ, Lawrence DA, Levine AJ, 
Pennica D, Roy MA and Wood Wl. WISP polypeptides and nucleic acids encoding same. 
Patent Number: 6,387,657. Date of Patent: May 14, 2002. 

Goddard A, Godowski PJ and Gurney AL. Tie ligands. Patent Number: 6,372,491. Date of 
Patent: April 16, 2002. 

Godowski PJ, Gurney AL, Goddard A and Hillan K. TIE ligand homologue antibody. Patent 
Number: 6,350,450. Date of Patent: Feb. 26, 2002. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM, Tie 
receptor tyrosine kinase ligand homologues. Patent Number: 6,348,351. Date of Patent: 
Feb. 19, 2002. 

Goddard A, Godowski PJ and Gurney AL. Ligand homologues. Patent Number: 6,348,350. 
Date of Patent: Feb. 1 9, 2002. 

Attie KM, Carlsson LMS, Gesundheit H and Goddard A. Treatment of partial growth 
hormone insensitivity syndrome. Patent Number: 6,207,640. Date of Patent: March 27 
2001. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Nucleic 
acids encoding NL-3. Patent Number: 6,074,873. Date of Patent: June 13, 2000 

Attie K, Carlsson LMS, Gesunheit N and Goddard A.. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,824,642. Date of Patent: October 20, 1998 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
Insensitivity syndrome. Patent Number: 5,646,1 13. Date of Patent: July 8, 1997 

Multiple additional provisional applications filed 
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PUBLICATIONS 

Seshasayee D, Dowd P, Gu Q, Erickson S, Goddard AD Comparative sequence analysis of 
the HER2 locus in mouse and man. Manuscript in preparation. 

Abuzzahab MJ f Goddard A, Grigorescu F, Lautier C, Smith RJ and Chernausek SD. Human 
IGF-1 receptor mutations resulting in pre- and post-natal growth retardation. Manuscript in 
preparation. 

Aggarwal S, Xie, M-H, Foster J, Frantz G, Stinson J, Corpuz RT, Simmons L t Hillan K, 
Yansura DG, Vandlen RL, Goddard AD and Gurney AL. FHFR, a novel receptor for the 
fibroblast growth factors. Manuscript.submitted. 

Adams SH, Chui C, Schilbach SL, Yu XX, Goddard AD, Grimaldi JC, Lee J, Dowd P, Colman 
S. t Lewin DA. (2001) BFIT, a unique acyl-CoA thioesterase induced in thermogenic brown 
adipose tissue: Cloning, organization of the human gene, and assessment of a potential link 
to obesity. Biochemical Journal 360: 1 35-1 42. 

Lee J. Ho WH. Maruoka M, Corpuz RT. Baldwin DT. Foster JS. Goddard AD. Yansura DG. 
Vandlen RL. Wood Wl. Gurney AL. (2001) IL-17E, a novel proinflammatory ligand for the IL- 
17 receptor homolog IL-17RM. Journal } of Biological Chemistry 276(2): 1660-1664. 

Xie M-H, Aggarwal S, Ho W-H, Foster J, Zhang Z, Stinson J, Wood Wl, Goddard AD and 
Gurney AL. (2000) Interleukin (IL)-22, a novel human cytokine that signals through the 
interferon-receptor related proteins CRF2-4 and IL-22R. Journal of Biological Chemistry 275: 
31335-31339. 

Weiss GA, Watanabe CK, Zhong A, Goddard A and Sidhu SS. (2000) Rapid mapping of 
protein functional epitopes by combinatorial alanine scanning. Proc. Natl. Acad. Sci. USA 97: 
8950-8954. 

Guo S, Yamaguchi Y, Schilbach S, Wada T.;Lee J, Goddard A, French D , Handa H, 
Rosenthal A. (2000) A regulator of transcriptional elongation controls vertebrate neuronal . 
development. Nature 408: 366-369. 

Yan M, Wang L-C, Hymowitz SG, Schilbach S, Lee J, Goddard A, de Vos AM, Gao WQ, Dixit 
VM. (2000) Two-amino acid molecular switch in an epithelial morphogen that regulates 
binding to two distinct receptors. Science 290: 523-527. 

Sehl PD, Tai JTN, Hillan KJ, Brown LA, Goddard A, Yang R, Jin H and Lowe DG. (2000) 
Application of cDNA microarrays in determining molecular phenotype in cardiac growth, 
development, and response to injury. Circulation 101: 1990-1999. 

Guo S, Brush J, Teraoka H, Goddard A, Wilson SW, Mullins MC and Rosenthal A. (1999) 
Development of noradrenergic neurons in the zebrafish hindbrain requires BMP, FGF8, and 
the homeodomain protein soulless/Phox2A. Neuron 24: 555-566. 

Stone D, Murone, M, Luoh, S, Ye W ( Armanini P, Gurney A, Phillips HS, Brush, J, Goddard 
A, de Sauvage FJ and Rosenthal A. (1999) Characterization of the human suppressor of 
fused; a negative regulator of the zinc-finger transcription factor Gli. J. Cell Sci. 112: 4437- 
4448. 

Xie M-H, Holcomb I, Deuel B, Dowd P, Huang A, Vagts A, Foster J, Liang J, Brush J, Gu Q, 
Hillan K, Goddard A and Gurney, A.L. (1999) FGF-19, a novel fibroblast growth factor with 
unique specificity for FGFR4. Cytokine 1 1 : 729-735. 
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Yan M.Lee J, Schilbach S, Goddard A and Dixit V. (1999) mE10, a novel caspase 
recruitment domain-containing proapoptotic molecule. J. Biol. Chem. 274(15): 10287-10292. 

Gurney AL, Marsters SA, Huang RM, Pitti RM, Mark DT, Baldwin DT, Gray AM, Dowd P, 
Brush J, Heldens S, Schow P, Goddard AD, Wood.WI, Baker KP, Godowski PJ and 
Ashkenazi A. (1999) Identification of a new member of the tumor necrosis factor family and its 
receptor, a human ortholog of mouse GITR. Current Biology 9(4): 215-218. 

Ridgway JBB, Ng E, Kern JA ,Lee J, Brush J, Goddard A and Carter P. (1999) Identification 
of a human anti-CD55 single-chain Fv by subtractive panning of a phage library using tumor 
and nontumor cell lines. Cancer Research 59: 2718-2723. 

Pitti RM, Marsters SA, Lawrence DA, Roy M, Kischkel FC, Dowd P, Huang A, Donahue CJ, 
Sherwood SW, Baldwin DT, Godowski PJ, Wood Wl, Gurney AL, Hillan KJ, Cohen RL, 
Goddard AD, Botstein D and Ashkenazi A. (1998) Genomic amplification of a decoy receptor 
for Fas ligand in lung and colon cancer. Nature 396(6712): 699-703. 

Pennica D, Swanson TA, Welsh JW, Roy MA, Lawrence DA, Lee J, Brush J, Taneyhill LA, 
Deuel B, Lew M, Watanabe C, Cohen RL, Melhem MF, Finley GG, Quirke P, Goddard AD, 
Hillan KJ, Gurney AL, Botstein D and Levine AJ. (1998) WISP genes are members of the 
connective tissue growth factor family that are up-regulated in wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors. Proc. Natl. Acad. ScL USA. 95(25): 14717- 
14722. 

Yang RB, Mark MR, Gray A, Huang A, Xie MH, Zhang M, Goddard A, Wood Wl, Gurney AL 
and Godowski PJ. (1998) Toll-like receptor-2 mediates lipopolysaccharide-induced cellular 
signalling. Nature 395(6699): 284-288. 

Merchant AM, Zhu Z, Yuan JQ, Goddard A, Adams CW, Presta LG and Carter P. (1998) An 
efficient route to human bispecific IgG. Nature Biotechnology 1 6(7): 677-681 . 

Marsters SA, Sheridan JP 4 Pitti RM, Brush J, Goddard A and Ashkenazi A. (1998) 
Identification of a ligand for the death-domain-containing receptor Apo3. Current Biology 8(9): 
525-528. 

Xie J, Murone M, Luoh SM, Ryan A, Gu Q, Zhang C, Bonifas JM, Lam CW, Hynes M, 
Goddard A, Rosenthal A, Epstein EH Jr. and de Sauvage FJ. (1998) Activating Smoothened 
mutations in sporadic basal-cell carcinoma. Nature. 391(6662): 90-92. 

Marsters SA, Sheridan JP, Pitti RM, Huang A, Skubatch M, Baldwin D, Yuan J, Gurney A, 
Goddard AD, Godowski P and Ashkenazi A. (1997) A novel receptor for Apo2L/TRAIL 
contains a truncated death domain. Current Biology. 7(12): 1003-1006. 

Hynes M, Stone DM, Dowd M, Pitts-Meek S, Goddard A, Gurney A and Rosenthal A. (1997) 
Control of cell pattern in the neural tube by the zinc finger transcription factor Gli-1. Neuron 
19:15-26. 

Sheridan JP, Marsters SA, Pitti RM, Gurney A., Skubatch M, Baldwin D, Ramakrishnan L, 
Gray CL, Baker K, Wood Wl, Goddard AD, Godowski P, and Ashkenazi A. (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and Decoy Receptors. Science 277 
(5327): 818-821. 
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Goddard AD, Dowd P, Chernausek S, Geffner M, Gertner J, Hintz R, Hopwood N, Kaplan S, 
Plotnick L, Rogol A, Rosenfield R, Saenger P, Mauras N, Hershkopf R, Angulo M and Attie, K. 
(1997) Partial growth hormone insensitivity: The role of growth hormone receptor mutations in 
idiopathic short stature. J. Pediatr. 131: S51-55. 

Klein RD, Sherman D, Ho WH, Stone D, Bennett GL, Moffat B, Vandlen R t Simmons L, Gu Q, 
Hongo JA, Devaux B f Poulsen K, Armanini M, Nozaki C t Asai N, Goddard A, Phillips H, 
Henderson CE, Takahashi M and Rosenthal A. (1997) A GPI-linked protein that interacts with 
Ret to form a candidate neurturin receptor. Nature, 387(6634): 717-21. 

Stone DM, Hynes M, Armanini M t Swanson TA, Gu Q, Johnson RL, Scott MP, Pennica D, 
Goddard A, Phillips H, Noll M, Hooper JE, de Sauvage F and Rosenthal A. (1 996) The 
tumour-suppressor gene patched encodes a candidate receptor for Sonic hedgehog. Nature 
384(6605): 129-34. 

Marsters SA, Sheridan JP, Donahue CJ, Pitti RM ( Gray CL, Goddard AD, Bauer KD and 
Ashkenazi A. (1996) Apo-3, a new member of the tumor necrosis factor receptor family, 
contains a death domain and activates apoptosis and NF-kappa p. Current Biology 6(12): 
1669-76. 

Rothe M, Xiong J, Shu HB, Williamson K, Goddard A and Goeddel DV. (1996) l-TRAF is a 
novel TRAF-interacting protein that regulates TRAF-mediated signal transduction. Proc. Natl. 
Acad. Sci. USA 93; 8241-8246. 

Yang M, Luoh SM, Goddard A, Reilly D, Henzel W and Bass S. (1996) The bglX gene 
located at 47.8 min on the Escherichia coli chromosome encodes a periplasmic beta- 
glucosidase. Microbiology 1 42: 1659-65. 

Goddard AD and Black DM. (1996) Familial Cancer in Molecular Endocrinology of Cancer. 
Waxman, J. Ed. Cambridge University Press, Cambridge UK, pp.1 87-21 5. 

Treanor JJS, Goodman L, de Sauvage F, Stone DM, Poulson KT, Beck CD, Gray C, Armanini 
MP, Pollocks RA, Hefti F, Phillips HS, Goddard A, Moore MW, Buj-Bello A, Davis AM, Asai N, 
Takahashi M, Vandlen R, Henderson CE and Rosenthal A. (1996) Characterization of a 
receptor for GDNF. Nature 382: 80-83. 

Klein RD, Gu Q, Goddard A and Rosenthal A. (1996) Selection for genes encoding secreted 
proteins and receptors. Proc. Natl. Acad. Sci. USA 93: 7108-7113. 

Winslow JW, Moran P, Valverde J, Shih A, Yuan JQ, Wong SC, Tsai SP, Goddard A, Henzel 
WJ, Hefti F and Caras I. (1995) Cloning of AL-1, a ligand for an Eph-related tyrosine kinase 
receptor involved in axon bundle formation. Neuron 14: 973-981 . 

Bennett BD, Zeigler- FC, Gu Q, Fendly B, Goddard AD, Gillett N and Matthews W. (1995) 
Molecular cloning of a ligand for the EPH-related receptor protein-tyrosine kinase Htk. Proc. 
Natl. Acad. ScL USA 92: 1866-1870. 

Huang X, Yuang J, Goddard A, Foulis A, James RF, Lernmark A, Pujol-Borrell R, 
Rabinovitch A, Somoza N and Stewart TA. (1995) Interferon expression in the pancreases of 
patients with type I diabetes. Diabetes 44: 658-664. 

Goddard AD, Yuan JQ, Fairbairn L, Dexter M, Borrow J, Kozak C and Solomon E. (1995) 
Cloning of the murine homolog of the leukemia-associated PML gene. Mammalian Genome 
6: 732-737. 
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Goddard AD, Covello R, Luoh SM, Clackson T, Attie KM, Gesundheit N, Rundle AC, Wells 
JA, Carisson LMTI and The Growth Hormone Insensitivity Study Group. (1995) Mutations of 
the growth hormone receptor in children with idiopathic short stature. N. Engl. J. Med. 333' . 
1093-1098. 

Kuo SS, Moran P, Gripp J, Armanini M, Phillips HS, Goddard A and Caras IW. (1994) 
Identification and characterization of Batk, a predominantly brain-specific non-receptor protein 
tyrosine kinase related to Csk. J. Neurosci. Res. 38: 705-715. 

Mark MR, Scadden DT, Wang Z, Gu Q, Goddard A and Godowski PJ. (1994) Rse, a novel 
receptor-type tyrosine kinase with homology to Axl/Ufo, is expressed at high levels in the 
brain. Journal of Biological Chemistry 269: 10720-10728. 

Borrow J, Shipley J, Howe K, Kiely F, Goddard A, Sheer D, Srivastava A, Antony AC, 
Fioretos T, Mitelman F and Solomon E. (1994) Molecular analysis of simple variant 
translocations in acute promyelocyte leukemia. Genes Chromosomes Cancer 9: 234-243. 

Goddard AD and Solomon E. (1993) Genetics of Cancer. Adv. Hum. Genet 21: 321-376. 

Borrow J, Goddard AD, Gibbons B, Katz F ? Swirsky D, Fioretos T, Dube I, Winfield DA, 
Kingston J, Hagemeijer A, Rees JKH, Lister AT and Solomon E; (1992) Diagnosis of acute 
promyelocyte leukemia by RT-PCR: Detection of PML-RARA and RARA-PML fusion 
transcripts. Br. J. Haematol. 82: 529-540. 

Goddard AD, Borrow J and Solomon E. (1992) A previously uncharacterized gene, PML, is 
fused to the retinoic acid receptor alpha gene in acute promyelocytic leukemia. Leukemia 6 
Suppl3: 117S-119S. 

Zhu X, Dunn JM, Goddard AD, Squire JA, Becker A, Phillips RA and Gallie BL. (1992) 
Mechanisms of loss of heterozygosity in retinoblastoma. Cytogenet. Cell. Genet. 59: 248-252. 

Foulkes W, Goddard A. and Patel K. (1991) Retinoblastoma linked with Seascale [letter]. 
British Med. J. 302: 409. 

Goddard AD, Borrow J, Freemont PS and Solomon E. (1991) Characterization of a novel zinc 
finger gene disrupted by the t(15;17) in acute promyelocytic leukemia. Science 254: 1371- 
1374. 

Solomon E, Borrow J and Goddard AD. (1991) Chromosomal aberrations in cancer. Science 
254: 1153-1160. 

Pajunen L, Jones TA, Goddard A, Sheer D, Solomon E, Pihlajaniemi T and Kivirikko Kl. 
(1991) Regional assignment of the human gene coding for a multifunctional peptide (P4HB) 
acting as the p-subunit of prolyl-4-hydroxylase and the enzyme protein disulfide isomerase to 
17q25. Cytogenet Cell. Genet 56: 165-168. 

Borrow J, Black DM, Goddard AD, Yagle MK, Frischauf A.-M and Solomon E. (1991) 
Construction and regional localization of a Not\ linking library from human chromosome 17q. 
Genomics 10: 477-480. 

Borrow J, Goddard AD, Sheer D and Solomon E. (1990) Molecular analysis of acute 
promyelocytic leukemia breakpoint cluster region on chromosome 17. Science 249* 1577- 
1580. 
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Myers JC, Jones TA, Pohjolainen E-R, Kadri AS, Goddard AD, Sheer D, Solomon E and 
Pihlajaniemi T. (1990) Molecular cloning of 5(IV) collagen and assignment of the gene to the 
region of the region of the X-chromosome containing the Alport Syndrome locus. Am. J. Hum. 
.Genet 46: 1024-1033. 

Gallie BL, Squire JA, Goddard A, Dunn JM, Canton M, Hinton D, Zhu X and Phillips RA. 
(1990) Mechanisms of oncogenesis in retinoblastoma. Lab. Invest 62: 394-408. 

Goddard AD, Phillips RA, Greger V ( Passarge E, Hopping W, Gallie BL and Horsthemke B. 
(1990) Use of the RB1 cDNA as a diagnostic probe in retinoblastoma families. Clinical 
Genetics 37: 117-126. 

Zhu XP, Dunn JM, Phillips RA, Goddard AD, Paton KE, Becker A and Gallie BL (1 989) 
Gfrmline, but not somatic, mutations of the RB1 gene preferentially involve the paternal 
allele. Nafure 340: .312-314. 

Gallie BL, Dunn JM, Goddard A, Becker A and Phillips RA. (1988) Identification of mutations 
in the putative retinoblastoma gene. In Molecular Biology of The Eve: Genes, Vision and 
Ocular Disease . UCLA Symposia on Molecular and Cellular Biology, New Series, Volume 88. 
J. Piatigorsky, T. Shinohara and P.S. Zelenka, Eds. Alan R. Liss, Inc., New York, 1988, pp. 
427-436. 

Goddard AD, Balakier H, Canton M, Dunn J, Squire J f Reyes E, Becker A, Phillips RA and 
Gallie BL. (1988) Infrequent genomic rearrangement and normal expression of the putative 
RB1 gene in retinoblastoma tumors. Mol. Cell. Biol. 8: 2082-2088. 

Squire J, Dunn J, Goddard A, Hoffman T, Musarella M, Willard HF, Becker AJ, Gallie BL and 
Phillips RA. (1986) Cloning of the esterase D gene: A polymorphic gene probe closely linked 
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We have enhanced the polymerase chain 
reaction (PGR) such that specific DNA 
sequences can be detected without open- 
ing the reaction tube* This enhancement 
requires the addition of ethidium bromide 
(EtBr) to a PCR. Since die fluorescence of 
EtBr increases in the presence of double* 
stranded (ds) DNA an Increase in fluores- 
cence in such a PGR indicates a positive 
amplification, which can be easily moni- 
tored externally* In fact, amplification can 
be continuously monitored in order to 
follow its progress. Hie ability to simulta- 
neously amplify specific DNA sequences 
and detect the prodnct of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiring high sample through- 
put 

Although the potential benefits of VCR 1 to clin- 
ical diagnoses arc well kiidwni 2 ' 5 , it U still not 
widely used m this setting, even though it is 
fau.r year* tinea thcra*ost*bl« DMA polymer- 
ases* made PGR twractical. Some of the reasons' for its slow, 
acceptance are high cost, tack of automation of pre-* and 
post- PCR processing steps, and false positive results, from 
carryovcT-contamination, The fim two points arc related 
in that labor is the largest contributor to cost at the present 
stage of PCR development* Most current assays require 
aornc f or ra of "downstream" processing once tbermocy* 
ding is done in order io determine whether the target 
DMA sequence was- present and has amplified. These 
tndude DNA hybridi7^don 5,6 , gel efcttropbcTOts with or 
without use of restriction digestion*'* HPLC?, or capillary 
electrophoresis 10 . These methods are labornntensc, have, 
low throughput, and arc difficult to automate. The third 
point is also closely related to downstream processing. 
The handling of the PCR product in these downstream 
processes increases the chances that amplified DNA . wilt 
spread through the typing- lab, resulting in -a risk of 



"carryover" false positives in subsequent testing". 

These downstream processing steps would be elimi- 
nated if specific amplification and detection of amplified 
DNA took place simultaneously within an unopened re- 
action vessel Assays m which such different processes take 
place without, the need to separate reaction components 
have been termed "J^mogeneous* 1 . No truly homoge- 
neous PCR assay has been demonstrated to date, although 
progress towards this end has been reported. Chehab, et 
al. 1 * developed a PCR product detection scheme using 
fluorescent primers that resulted in a fluorescent PCR 
product Allck-specific primers, each with different fluo- 
rescent tags, were used to indicate the genotype of the 
DNA. However, the unincorporated primers must still be 
removed in a do wnstream process in order to visualize the 
result Recently, Holland, et al ls , developed an assay in 
which the endogenous 5 r exouudease assay of Taq DNA 
polymerase was exploited to cleave a labeled oligonucleo- 
tide probe. The probe would only dcave if PCR amplify 
cation had produced its coroptemeataiy sequence. In 
order to detect the dcavage products, however* a subse- 
quent process is again needed. 

We have developed a truly homogeneous assay for PCR 
and PCR product detection based upon tbc gready in- 
creased fluorescence that e^hidxura btoinide and other 
DNA binding dyes exhibit when they are bound to. ds- 
DNA t4: ~ ie . As outlined in Figure 1; a prototypic PCR 



iPCRordfO 1 



/ 



ssDNA 



(op topginxniatt) 




1 Prindplc of simuitancoua amplification and- detection of 
PCR product: The component* of a PCR containing EtBr that aro 
fluorescent are listed— EtBr itself, EtBr bound to other ssDNA or 
daDN A, There ht a lar^c fijiorescencc enhancement when Etlir is 
bound to DNA and htndih> ta gi*cady enhanced when DNA is 
double-nrandcd. After sufficient (n). cydes of PCR, the net 
increase m dsDNA residts in additional EtBr binding, and * net 
increase in total (luorcsccncc: 
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WOTttGeldcctrophoresis of PCll salification prcdiicfc of the 
human, mtdcar gene, HLA DQti, made in the presence of 
increasing amounts of EtBr (up to 8 Hg/ml). The presence of 
EtBr has no obvious effect on she yield or specificity of amplifi- 
cation. 
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HOOTS ft (A) fluorescence measurements froth PCRs that contain 
0.5 |bgfai3 EtBr and thai are specific for Y^hrotnotoxbc repeat 
ttooence*. Five replicate FCRs *ere begun containing each of the 
DNA* specific. At each indicated cycle, one of the five replicate 
PCks for each DNA was removed from thcrmocyding and it* 
fluorescence measured. Unit* of fiuor««ccnc* are artmrary. (B) 
UV photography of PGR tube* (0.5 nil Eppcndarkstylc, pofvpro* 
pyicrye microcentrifuge :tubes) containing reactions, those start- 
ing from Z ng male DNA and control reactions without any DNA, 
from (A). 



begins with primers that are single-stranded DNA (ss- 
DNA)» ciNTPs, and DNA polymerase! An amount of 
dsDNA containing the target sequence (target DNA) h 
also typically present. This amount can vary, depending 
on the application, from single-cell amounts of DNA 17 to 
micrograms per PCR^ 6 , If EtBr is present the reagents 
that will fluoresce, in order of mcrcasittf fluorescence, are 
free EtBr itself, and EtBr bound to the single-stranded 
DNA primers ami to the double-stranded target DNA (by 
its intercalation between the stacked bases of the DNA 
doubic-hefiat). After the first denatu ration cycle, target 
DNA will be largely single-stranded. After a PGR is 
completed, the most significant change as the increase in 
the Amount of dsDNA (the PGR product itself) of up .to 
several micrograms. Formerly free EtBr is bound to the 
additional dsDNA* resulting in an increase m fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because the binding of EtBr to ssDNA 
ts much less than to dsDNA, che efTect of this change on 
the total fluorescence of the sample is smalL The fluores- 
cence (increase can be measured by directing exekation 
illumination through the walls of the amplification vessel 



before and after, or even coniiouously during, thermocy- 
RESULTS 

PCK in the presence of EtBr- In order to assess th e 
affect of EtBr in PGR, amplifications of the human HLft 
DQa gene* 9 were performed with the dye present at 
concentrations from 0.06 to 8.0 u.gfaru (a ly&icaj concen- 
tration of EtBr used in staining of nucleic acids following 
gel electrophoresis is 0.5 pgfmf). As shown in Figure 2, gel 
electrophoresis revealed little or no difference in the yield 
or quality of the amplification product whether' EtBr was 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not inhibit PGR, 

Detection of human Y-chromewra«j specific 
«pences* Sequence-specific, fluorescence enhancement of 
EtBr as a result of PGR was demonstrated in a series of 
amplifications containing 0.5 u.g/ml EtBr and primers 
fipeoftc to repeat DNA sequences found on the human 
V-chromosomc 80 . These PCRs initially contained either 
60 ng male, 60 ng female, 2 ng roak human or no DNA. 
Five replicate PCRs were begun for each DNA* After 9 y 
1?) ,21 , 24 and 29 cycles of therniocycling, a PCR for each 
DNA was removed from the thermocycler, and its, fluo- 
rescence measured in a spectroflnorometex and plotted 
vs. amplification cyde number (Fiff. 3 A). The shape of th» 
curve tcHccts the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA U 
becoming linear and not exponential with cycle number; 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain* 
ing human male DNA, but did not significantiy increase 
for negative control PCRs, which contained either no 
DNA or human female DNA- The more male DNA 
present to begin with— 60 ng versus S ng— the fewer 
cycles were needed to give a detectable increase in fluo- 
rescence. Gel electrophoresis oo the products of these 
amplifications showed that DNA fragments of the ex- 
pected sue were made in the male DNA containing 
reactions and that Hide DN A synthesis took place in the 
control samples. 

In addition, the increase in. fluorescence was visualized 
by simply laying the completed, unopened PCRs on a UV 
transuluTTunator and photographing them through a red* 
niter. This is shown in figure SB ibr the reactions thai 
began with 2 ng male DNA and those with no DNA* 

Detection of specific allele* of the human fi-globin 
gene. In order to demonstrate that this approach has 
adequate specificity to allow genetic screening, a detection 
of the skkle-cell anemia mutation was performed* Figure 
4 shows the ftuoteseencc from completed amplification* 

containing EtBr (0.5 ^Qftrd) a* detected by photography 

of the reaction cubes on a UV transiUuminator. These 
reactions were performed using primers specific for ci- 
ther the. wild-type or sickk-cell mutation of the human 
^globin gene". The specificity for each allele is imparted 
by placing the sickle-mutation site at the terminal V 
nucleotide of one primer. By using an appropriate prkntr 
annealing temperature* primer extensiorwand thus an> 
pU6eat»n--can take place only if the 3' nucleotide of the 
primer i$ complementary to the p-globin allele present"^ 2 . 
. Eadi^ajrofamplificauong shown in Figure 4 consists of 
a reaction with either the wild-type allele spedfk (left 
tube) or skJdc-aUeie specific (right tube) primers. Three 
different DNAs. were typed: DNA from a homozygous, 
wild-type p-globin individual (A A); from a heterozygous 
sickle p~gfpbin individual (AS); and from a homozygous 
sickle ^globb mdividual'(SS). Each DNA (50 ng genomic 
DNA to start each PGR) was analyzed m triplicate (3 pairs 
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c f reactions each). The DNA .type vas reflected in the 
relative fluorescence intensities in each pair of completed 
flfn p]t£cHtioiifi. There was a significant increase in fluores* 
p n oc onl J where a p~globin allele DNA matched the 

!>rii»cf act, When measured on a spcctrc&noroinctcr 
data not shown), this fluorescence was about three tines 
diat present in a FCR where both p-globm alleles were 
jnbmatchcd to the primer set. Gel clcorpphofesk (not 
dhown) e$ral>Hshcd that this increase in fluorescence was 
due to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for 0-globin. There was 
jitdc syntHttifi of dsDNA in reactions in. which the aliele- 
jtpedne primer was mismatched to both alleles* 

Continuous inowtoriog of a PCR. Using a fiber optic 
device? H is possible to direct excitation illumination from 
n spectrofl uoromete r to a PGR undergoing thcrmocycling 
and to return its fluorescence to the Rpcctroftuion>f»ctcr. 
The fluorescence readout of such an arrangement, di- 
rected at an EtBr-containing amplification of Y-chromo- 
wme specific sequences frorn 25 ng of human male DNA* 
is shown in Figure 5. The readout from a control fCR 
trfih no target DNA is also shown. Thirty cycles of PGR 
verc monitored for each. 

The fluorescence trace as a function of time dearly 
shows the effect of the thermocyding, Fluorescence inten- 
sity rises and, fails inversely with temperature. The fluo- 
rescence intensity is minimum at the denaturadon tem- 
perature (*rt°C) and raaxiiiium attheanneaUWextension 
temperature (50°C>- In the negative-control PCR, these 
fluorescence maxima and minima do not change signifi- 
cantly over the thirty tbertaocyde^ indicating that there is 
Uttlc dsDNA synthesis without the appropriate target 
DNA, and there b little if any bleaching of EtBr during 
the continuous illumination c*f the sample. 

In the PGR containing male DNA, the fluorescence 
maxima at the annealing/extension temperature begin to 
increase at about 4000 seconds of thennocyding, and 
continue to increase whb time, indicating that dsDNA is 
being produced at a detectable level. Note that the fluo- 
rescence minima at the denaturaoon temperature do not 
significantly increase, presumably because al thb temper- 
ature there is no dsDN A for EtBr to bind. Thus the course 
of the amplification is followed by tracking the fluores-. 
cence increase at the annealing temperature. Analysis of 
ihc products of these two amplifications by gelclcctropbo- 
news sho"wed a DNA fragment of the eiqpectcd size for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sample. 

DISCUSSION 

Downstream processes such as hybridization to a se- 
qucncenApedfic probe can enhance die specificity of DMA 
deicuiuu by PGR. The cKnxiratkin <y€ these proccsaco 
means that* the specificity of this homogeneous assay 
depends solely on that of rCtL In the case of sickie-celi 
dLicase, wc have shown that PGR. alone has sufficient DNA 
sequence apecifiaLy to permit genetic screening, Using 
appropriate amplification conditions, there is little non* 
specific production of ckDNA in the absence of the 
appropriate target allele. 

The specificity required to dctxxt pathogens can be 
more or less than that required* to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with the 
sample. A difficult target is HIV, which requires detection 
of a viral genome that can be at the level of a few copies 
per thousands of host cells 1 *, Comoared with genetic 
screening, which is performed on ceils containing at least 
one copy of die target sequence, HIV ideteetion requires 
both more specificity and the input of more total 
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BGWfc 4 UV photography .of PCfc tubes containing axxwU^omioiu 
tisiog Eti&r trot are specific to wioVtjpc (A) or ricWc (5) alleles of 
the human £-globin gene. The left of each pair of tubes contains 
allek-»pcdfic primers to the wild-type alleles, the right mbc 
primers to the sicWe attek. The phmograph was tafceh after 50 
cycics of PGR, and Che input DNAs and the alleles they contain 
arc indicated. Fifty i»g of DNA was used to bCpn PGR. Typmg 
was done m triplicate (3 pair* of PO&) for each input DNA: 
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nCOBS 5 Continuous, real-time monrtoring of a PC R. A fiber optic 
was roc*! to <»rry. excitation light to a PCR m progress and also 
emitted light back to a fluoromctcr (see Experimental PnKocoty 
Amplification wins human m&lo-DNA specific primers fn ft PCR 
starting with 20 ng of human male DNA <top), or in a control 
PGR without DNA (bottom), were monhored. Thirty cvdej of 
PGR were foRowed for each. The' temperature cycled between 
94*C (denaturAtxcm) and 50*C (annealing and extension). Note in 
the male DNA. PC&,the cycle (time) dependent increase in 
fluorescence at the anneafin^exteDaion temperature, 
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DNA— up to microgram anioun&™-ia order to have suf- 
ficient numbers of target sequences. This large amount of 
starting DMA m an amplification sic&i&c^Ticly increases 
tht background fluorescence over which any additional 
fluorescence ptoduced by PGR must be detected. An 
additional complication that occurs with targets in; low 
copy-number is the formation of the ^rimet-dhner" 
artifact. This is the result of die extension of one primer 
using the other primer as a template. Although this occurs 
infrequently* once it occurs the extension product is a 
substrate for PCR amplification* and can compete wtth 
true PCR targets if those targets are rare. The primer- 
d ww product is of course dsDNA and thus U a potential 
source of false signal in this homogeneous assay. 

To increase PCR specificity W reduce the effect of 
primer-dimcT amplification, we are investigating a num- 
bcr of apjpraaches, including die use of nested-primer 
amplifications diat take place in a single tube 8 , and the 
"hot-start", in which nonspecific amplification is reduced 
by raising the temperature of the reaction before DNA 
synthesis begins 25 . Prdhninary results using these ap- 
proaches suggest tbatpruncr-diracT is effectively reduced 
and it is possible to detect the increase in Etfir fluores- 
cence in a PCR instigated by a single HIV genome in a 
background of 10* celts. With larger numbers of cells, the 
background fluorescence contributed by genomic DNA 
becomes probteinatie- To reduce this background, it may 
be possible to use sequence-specific DNA-binding dyes 
that can be made to preferentially bind PCR product over 
genomic DNA by incorporating the dye-binding DNA 
sequence into the PCR product through a 5' *add-on n to 
the oligonucleotide primer* 1 . 

We have shown that the detection of fluorescence 
generated by an EtBr-containing PCR is straightforward, 
both once PCR is completed and continuously during 
UierniocycKng. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
pro mking aspect of this assay. The fluorescence analysis 
of completed PCR* is ahcadypossibk with existing instru- 
roentauon in 96-weJI format**. In this format, the fluores- 
cence in each PCR can be cjuantitated before, after, and 
even at selected points during t±errnocyciin.g by moving 
the rack of PCRs to a 96-mictowcH plate fluorescence 
reader 40 , 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A dh-cct c* tension of the apparatus used here is to have 
multiple fiberoprJcs transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number, figure 3 shows that 
the larger the amount of starting target DNA, the sooner 
during- PfXR a fluorescence increase is detected. Prelimi- 
nary experiments <Biguchi and DoHinger, manuscript in 
preparation) with continuous monitoring have shown a 
sensitivity to two-fold differences in initial target DNA 
concentration. 

Conversely, if the number of target molecules is 
known — as* it can be in genetic screeriing-rcontinuous 
monitoring may provide a means pf detecting false posi- 
tive and false negative results. With a known number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR. 
Increases in fluorescence detected before or after that 
cycle would indicate potential artifacts* False negative 
resuks due to, for example,. inhibition of DNA potymcr- 
ase, may be detected by including within each PCR an 
inefficiently amplifying marker. This marker results in a 
fluorescence increase only after a large number of cy- 
cles— many more than arc necessary to detect a true 
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positive- If a sample fails to hare a fluorescence increase 
after this many. cycles, inhibition may be suspected. Since, 
h> this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event, before any test based on this 
principle is ready for the chruc, an assessment of false 
positive/false negative rates will need to be obtained "using 
a large number of known samples. 

In nummary, the inclusion m PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect specific DNA amplification from outside 
the PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
in applications that demand the high throughput of 
samples. 

EXPERIMENTAL PROTOCOL 

Human HLA-DQ<m gene *mpKflcatiorks containing £u*x. 
PCRs were set up inlOO u 1 volumes containing 10 mM Tris-HCh 
pH 8.3; 50 mM KCJ; 4 raM MgO*: 2-5 units of too DNA 
polymerase (PerluiwgWr Genu. Norwalk^ CT); 20 pinole each 
of human HtA-DQa gene specific oligonucleotide primers 
(VH25 and CHS? 19 and approximately 10* copies of DQfr PCR 
product diluted from a previous reaction- Ethidium bromide 
(El Br; Sigtnn} was used at the concentrations indicated to Figure 
2* Thcrmocyding proceeded for 20 cycles in a model 480 
thcrmocydcr (Pcrkm-Dmcr Gctu*, Norwalk, CT) uang a "step- 
cycle^ promm of 94*C for I xnm.denaturaUon and 60*0 for$0 
sec unncaung and 72 *C for 30 sec. extension, 

Y-chromosomc specific PCR* PCRs (100 \il total reaction 
volume) containing QJ5 p&Anl EtBr were prepared as described 
for HLA-DQct, except with different primers and target DNAj. 
These PCRs contained J $ prnotc each male DNA-specifie prtmen 
YT.l and and cither 60 ng male, 60 eg female, 2 ng male, 

or no human DNA. Thennocyding ^sWCTpr 1 min- and WC 
for 1. inin uang a Vcp-cyde* p r og r a m. The number of cycles for 
a sample were as indicated in Ffguve 3. fluorescence measure- 
ment u described below. 

Allcfc-apccrfic, human £-glotnn g^u* PCR, Amplifications of 
100 y-1 volume vtmg 0 5 ^Lg/tnl of EtBr were prepared a? 
described for HLA4>Q& above except with different primer* and 
target DNAfi. These PCRs combined eilber primer pair HOPS/ 
HgMA <vBd-type globin speetSi primers) or HOP2/llpHS fs«k- 
-lc-globin speafic primers) at 10 pinole Sach primer per PCR, 
These primers were developed by Wu ct aL 2 \ Three different 
tACgei DNAa were tutcd in separate amplifkatkrasr— 50 ng cacU of 
human DNA that was homozygous for the *ieMc trait (3S)« DMA 
that was hcterDryrovw for the sickle traK <AS)» or DNA that was 
homozygous for the w.c- £lobin (AA). Thermocyding wa> for 30 
cycles at 94*0 for 1 mm. and 55*0 Cor 1 min. using n ,, stcp-cYCi<f* , 
program. An annealing temperature of SS^C had been .shown by 
Wu et al 2J w provide, allclc-epcdfic atppliftsation. Completed 
PCRs were photographed through a red filter (Written 
after placing the reaction tubes atop a model TM-36 transiHurfli- 
natof (UV-productS Sah' Gabriel, CA). 

Fluo< es«n ce wieaflnremettt Fluorew»ee measurement^ were 
txiade on PCRs contoininj; EtBr in a Fluorolog*2 Euoromctcr 
CSPEX, Edisyn, NJ). Excitation was at the 500 nm band with 
atout 2 nm bandwidth with a GO 455 nm cut-off filler jMclles 
Crist, Inc., Irvine. CA) to exclude secctnd*order light. Emitted 
light was detected a( 570 nm with a bandwidth of about 7 nm. An 
OG 630 om cut-off filter was used to remove the exchauon hght* 

ContitatouA ftnorescenee mo ni tor i ng of PCR, Continuous 
monitoring of a PCR in progress was accomplished using the 
Epcctrofiuoromebct And settings described Above as well as a 
fiberoptic accessory (SP£X cat, no, 1950)- to both send excitation 
light to. and receive emitted light from, a PCR placed in a well of 
a model 480 ihermocydcr (Pcrkin-Elmer Celus). The probe end 
of the fiberoptic cable was attached with K 5 ovm.ute-cpco^ * to the 
open top of a PCR tube (a O.o ml polypropyJenc crntrifigc tube 
with its cap removed) effectively scaling il The crtposed/top of 
the PCR tube and the end of the fiberoptic caWe were shielded 
from Toom light and the room ljgtttt were kept dimmed during 
• each nm. The monitored PCR was an amplication of Y-d>TO- 
mos6me-6pcdnc repeat sequences as described above, except 
u$mg\an anncahneyextension cerxiper&uirc ofSOX. The reaction 
was covered with mineral oil (2 drops) to prevent evaporation. 
Therroccyclmg' and* fiuoresocncc measurement were started si' 
multancously. A time-base scan with a 10 second integradop' thnc 
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km uwjd and the emtettOB stgnnl was ratioed to'tbc excitation 
nigOJtl to control lot changes in light-iourec intensity, feita.wcre 
^Slcci^d using the droSOOOf, version 15 (SVEX) data systcm. 
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IMMUNO BIOLOGICAL LABORATORIES 



SCD-14 EUSA 

Trauma, Shock and Sepsis 




The CD-I 4 molecule is' expressed on the surface. of 
rnonocytes and some macrophages. Membrane- 
bound CD-u is a receptor for lipopotysaccharide 
(LPS) complexed to LPS-8inding-Protein (LBP). The 
concentrailon of Its soluble form is aftered under 
certain pathological conditions. There, is evidence for 
an important rote of sCD-14.wfth polytrauma, sepsis, 
burnings and inflammations. 
During septic conditions and acute infections ii seems 
to be a prpgnostfc marKer and is therefore of value in 
monitoring these partients. 



IBL offers an ELISA for quantitative determination of 

soluble CD-14 in human serum, -plasma, cell-culture 

supernatants and other biological fluids. 

Assay features; 12x8 determinations 
(microSter strips^ 
precoated with a specific 
monoclonal antibody, 
2x1 hour incubation, 
standard range: 3-96 ng/mi 
detection limit: 1 ngAnl 
CV: intra- and tnterassay < 8% 



For more information call or fax 



GESELLSCHAFT FOR IMMUNCHEMIE U N D~ B I OLOGiFmbh 

OSTERSTRASSE 86 -D- 2000 HAMBURG 20 -GERMANY TEL. +40/491 00 61-64 • FAX +40/40 1198 
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SIMULTANEOUS AMPLIFICATION AMD DETECTION Of 
SPECIFIC DMA SEQUfHCES 

fcussdl Higuchi*, Gavin DoiliBger 1 , P- Scan Walsh and Robert Griffith 

Roche tfotccular Systems, Inc.. 1400 55rd St., Emeryvttte, CA 94608- *C!»iron Corporation, 1400 53rd St, ErtUtfyviflc, CA 
^Corresponding author. 



We Iiave enhanced the polymerase; chain 
t eactian (PGR) such that specific DNA 
sequences can be detected without open- 
ing the reaction tube, This enhancement 
requires the addition of ethidium bromide 
(EtBr) to a PGR- Since the fluorescence of 
EtBr increases in the presence of double* 
stranded (d$) DNA w increase in fluores- 
cence in such a PGR indicates a positive 
amplification, which can be easily moni- 
tored externally. In fact, amplification can 
be continuously monitored in order to 
follow its progress. Tile ability to simulta- 
neously amplify specific DNA sequences 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiring high sample through- 
put 

Although the potential benefits of rCR* to, clin- 
ical d&gnc*tics arc welt kiidwv 3 ' 5 , it is still not 
widely used in this setting, even though it is 
fnxxt- year* tinea thermostat* F^7 rn ^ r * 
aae$ 4 made PCR practical. Some of the reasons for Its slow 
acceptance are high cost, tack of automation of pre* and 
post-PCR processing steps, and false positive results, from 
carryover-contamination. The first two points arc related 
in that labor is the largest contributor to cost ait the present 
stage of PCR development.* Most current assays requite 
some form of "downstream" processing once tbermocy* 
cling ifi done in order to determine whether the target 
DNA sequence was present and has amplified. These 
include DNA hybrkJisation*"* gel ekefcropboreus with or 
without use of restriction digestion*: 0 , HPLC?, or capillary 
dectrophorcsv 10 . These methods are labor-intense, haTe. 
bw throughput, and axe difficult to automate. The third 
point is also closely related to downstream processing. 
The handling of the PCR product in these downstream 
processes increases the chances that ainpftfjed DNA witt 
spread through the typing ■ lab, resulting in • a .risk of 



carryover" false positives in subsequent testing". 
These downstream processing steps would be elinti- 
nated if specific amplification and detection of amplified 
DNA took place simultaneously within an unopened re- 
action vessel Assays in whieh such different processes take 
>lace without, the need to separate reaction components 
iave been termed *^mogeheous"'. Wo truly homoge- 
neous PGR assay has been demonstrated to date, although 
progress towards this end has been reported. Chehab, et 
aL , developed a PCR product detection scheme using 
fluorescent primers that resulted in a fluorescent PCR 
product Allck-specific primers, each with different fluo- 
rescent tags, were used to indicate the genotype of the 
DNA. However, the unincorporated primers must still be 
removed in a downstream process in order to visuaHzc the 
result Recently, Holland, et al> 13 * developed an assay in 
which the endogenous 5 r -cxoBUclease assay of T*q DNA 
polymerase was exploited to cleave a labeled -oligonucleo- 
tide probe. The probe would only dcave if PCR ampli- 
cation had produced its complementary sequence. In 
order to detect the dcavage products, however, a subse- 
quent process w again needed, 

We have developed a truly homogeneous assay for PCR 
and PCR product detection based upon tbc greariy in- 
creased fluorescence that ethidium bromide and other 
DNA binding dyes exhibit when they arc bound to.ds- 
DNA t4: - l<? . As outHnccl in Figure 1, a prototypk PCR 
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ROVRE I Principle of simultaneous amplification and detection of 
PCR product The components of a PCR coatainh^ EtHr (bat are 
fluorescent are ftswd—EtBr itself, EtBr bound to other ssDN A or 
dsDN A. There vt a large ftiwrcscencc enhancement when EtBr Is 
bound to DNA and bmdink u greatly enhanced when DNA .is 
douhlc-yrzndcd. After sumdent (n). cydcs of PGR, the .net 
ificrca.se in d*£>NA resuks in- additional EtBr bjndBn^ and ? net 
incrcftsc in total •fluaresccnan 
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WTO ^ Gel elcctropltoresis of PClL am plifiotion prodiitti-of the 
tromm, nuclear gene, HLA DQti, mad* in the presence of 
SJ?^ amounts ofEtBr (up to 8 n-g/ml). The presence of 
EtBr has no obvious effect on the yield 0r spccrJidty of ampUfi- 
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ROTAS > (A) fluorescence rncasurcnncnU •ft'Ota PCRs that contain 
0.5 ^ui! ElBr and thai site specific for Y^hromoioroe repeat 
waoetioe*. Five replicate PCRs *ere begun confining each of the 
521* l pedfi t 1, At ,)Ba ™ Mk*** cycle, one of the five replicate 
PCRs for each DNA Vas removed from thcrmocydmg and tts 
fluorescence measured. Unit* of fluorescence are arbitrary. (B) 
UV photography of PCR tube* (0.5 nil Eppcndorf^tylc, Mypro* 
pylcne mtcro-cemiifuTO tubes) conumttag reactions, those start* 
ing from 2 ng male DNA And control reactions without any DNA. 
from (A), ' 



begins with primer* that are single-stranded DNA (ss- 
DNA), «ANTPs, and DNA polymerase; An amount of 
dsDNA containing the target sequence (target DMA) is 
also typically present. This amouat can vary, depending 
on the application, from single-cell amounts of DNA 17 to 
micrograms per PGR? 5 , If EtBr is present, the reagent* 
that will fluoresce, in order of increasing fluorescence, are 
free ElBr hscl£ t and EtBr bound to the stngle-fittandcd 
DNA primers and to the doublcsstnmded target DNA (by 
its intercalation between the stacked bases of the DNA 
dooblc-hcfix)* After the first denaturatibn cyde, target 
DNA will be largely single-stranded. After a PGR is 
completed, the most significant change is the increase in 
tbc amount of dsDNA (the PCR product itself) of up to 
several micrograms. Formerly free EtBr is bound to the 
additional dsDNA 4 resulting in an increase m fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because tbe binding of ElBr to ssDNA 
is much less than to dsDNA, the effect of uw change on 
the total fluomccricc of the sample is srnalL The flnorcs- 
cence increase can be measured by directing excharion 
illumination through the walls of the amplification vessel 
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before and after, or even continuously during, thermocy. 



RESULTS 

PCR in the presence of EtBr. In order to assess th e 
affect of ElBr in PCR, amplifications of the human Hl„A 
DQx gtne >9 were performed with the dye present at 
concentrations from 0,06 to 8.0 tLgfrnl (a typical concen- 
tration of EtBr used in staining of nudek acids following 
get ejectmphoresis is 0.5 u-g/rnf). As shown in Figure 2, g| 
electrophoresis revealed little or no dificrencc in the yield 
or quali ty of the amplification product whether EtBr wa* 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not inhibit PCR, 

Detection of homan Y-chi ouiosomc specific 
qaences^ Sequence-specific, fluorescence enhancement of 
EtBr a* a result of PGR was demonstrated in a scries of 
amplifications containing 0,5 ug/ml EtBr and primer* 
specific to repeat DN'A sequences found on the human 
Y-chromosomo* 0 - These PCRs initially contained cither 
60 ng male. 60 ng female, 2 ng rrjafc human or no DNA. 
Five repKcate PCRs were begun for each DNA* After 0, 
17, 21 , 24 and 29 cycles of thermocydingj a PGR for cadi 
DNA was removed from the thermocycler, and its, fluo- 
rescence measured in a spectroflnorometer and plotted 
vs. amplification cyde number (fig. 3A), The shape of this 
curve rcSccts the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA b 
becoming linear and not exponential with cycle number; 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain* 
ing human male DNA, but did not significantly increase 
for negative control PCRs, which contained either no 
DNA or human female DNA, The more male DNA 
present to begin with — 60 ng versus 2 ng-^he fewer 
cycles were needed to give a detectable increase in fluo- 
rescence. Gel electrophoresis oo the products of these 
amplifications showed that DNA fragments of the ex- 
pected msc were made in the male DNA containing 
reactions and that tiule DN A syathe$i$ took place in the 
control samples. 

In addition, the increase in. fluorescence was visualized 
by simply laying the completed* unopened PCRs on a UV 
transiliomiTiatOT and photographing them through a red 
Alter. This is shown in figure SB lor the reactions that 
began whh 2 ng male DNA and those with no DNA- 

Detection of specific allele* of the human fl-globm 
gene. In order to demonstrate that this approach has 
adequate specificity to allow genetic screening* a d&cction 
of the sickle-ceil anemia mutation was performed. Figure 
4 shows the fluorescence from completed amplications 

containing EtBr {OS Rgfcnl) as detected bf photography 

of the reaction tubes on a UV a^ansiUuminaior, These 
reactions were performed using primers specific for ci- 
ther the. wad-type or sickle-cell mutation of the human 
frglobin gene* 1 . The specificity for each, allele is imparted 
by placing the sickie-mutation site at the terrninaJ 3' 
nucleotide of one primer. By using an appropriate primer 
armealing temperature, primer extension— and thus an> 
ptifceation — can take place only if the 3' nucleotide of the 
primer i$ complementary to the $-globin alJcfc pi^scm^** 2 - 
Each pair of amoBficarions shown in Figure 4 consists of 
a reaction with either tbc wiM-typc allele «pedfic (left 
tube) or sicWc-allele specific (right tube) primers. Three 
different DN As were typed: DNA from a homozygous, 
wild-type £~globin individual (A A); from a heterozygous 
p^ipbin individual (AS); and from a homozygous 
sickle p-globb individual (SS). Each DNA (50 ng genomic 
DNA to start each PGR) was analyzed m pHcate (3 pairs 
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pf reactions each). The DNA .type tos reflected in the ' 
itjtfxvt fluorescence intensities in each pair of completed 
amplification*. There was a significant increase in fluores- 
ce oftly vrherc a f^gtobin aQele DNA matched the 
primer set When measured on. a spcctroflncronictcr 
[data not shown), this fluorescence was about three times 
dttt present in a FCR where both f^globhi alkies were 
mismatched to the primer set. Gel ckcaTopho-retfo (aot 
jKown) es&Misheti that thw increase in fluorescence vas 
due to trie synthesis of nearly a microgram of a DNA 
fragment of the expected size for p-globin. There was 
litdc synthesis of dsDNA in reactions in.which the alkie- 
spedfic primer was mismatched to both alleles. 

Conttavows monitoring of a PGR* Using a fiber optic 
devkerK is possible to direct excitation illumination from 
3 spectrofluorometer to a PGR undergoing thermocycling 
and to rctwn its fluorescence w the RpectrofihJow»eter. 
The fluorescence readout of such an arrangement, di- 
rected At an EtBr-containing amplification of Y<hromo- 
t$mc spcettk sequences from 25 r»g of hiim?n male DNA, 
is shown in Figure 5. The readout from a control PCR 
vhh no target DNA is also shown. Thirty cycles of PGR 
were monitored for each. 

The fluorescence trace as a function of time dearly 
shows the effect of the thermocycling, Fhioracence inten- 
sity rises and. fails inversely with temperature. Hie fluo- 
rescence -intensity Is minimum at the denaturation tem- 
perature <&4°C) aad maximum at the anncaUn^exteit^ion 
temperature (50°C). In the negative-control FCR> these 
fluorescence maxima and minima do not change signifi- 
<3>ntfy over the thirty tbcrmOcyde*, indicHting ihat there is 
IKtlc dsDNA synthesis without the appropriate target 
DNA, and there is little if any We^hSiig of EtBr during 
(h< continuous illumination of the sample. 

In the PGR containing male DNA, the fluorescence 
maxima At the annealing/extension temperature begin to 
increase at about 4000 seconds of therroocycling, and 
continue to increase who time, indicating that dsDNA is 
being produced at a detectable leveL Note that the fluo- 
rescence minima -at the denaturation temperature do not 
fligtuftcandy increase* presumably because at this temper- 
ature there is no dfiDN A for EtBr to bind- Thus the course 
of the amplification is followed by tracking the fluorcs-. 
cence increase at the annealing temperature. Analysis of 
the products of th<»e two amplifications by gel elcctropho- 
Jrois showed st DNA fragment of the eiqpcctcd size for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sample. 

DISCUSSION 

Downstream processes such as hybridi7ation to a se- 
queace*pedftc probe can enhance the specificity of DNA 
deceuiuu FCR. T~hc chxnination of thcae processes, 
means that* the specificity of this homogeneous assay 
depends solely on ihat of FCSL In the case of sickle-celi 
disease, we have shown that PGR alone has sufficient DNA 
ftcquencc BpedficiLy to permit genetic screening. Using 
appropriate amplification conditions, there is little non- 
specific production of dsDNA in the absence of the 
appropriate target allele. 

rhe specificity required to detect pathogens can be 
more or less than that required to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with the 
sample A difficult target is HIV, Which requires detection 
of a viral genome that can be at the level of a few copies 
per thou/ands of host ccfls 6 . Compared with generic 
screening, which is performed on cells containing at least 
one copy of die target -sequence* HIV Idetection requires 
both more speernaty and the bput of more total 
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WWE 4 UV photography of PCS tubes containing amptificatioiis 
using EtBr th« zx* specific to wiWUtjpe (A) or sicwe (5) alleles of 
the mmuiD f£-globin gene. The left of e^ch pair of tubes contains 
fiSele^pcdfic primers to the wild-type alleles, the right tube 
printers to the steWe futtek- The photograph was fcuWn after SO 
cycles of PCR^and the input DNAs and the alleles ihcv contain 
are indicated. Fifty tog of DNA was used to begin VGiL Typing 
was done in triplicate (3 pairs of PC&) for cadi input DNA 
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20r>g of mote DNA 
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no DNA control 




0 2000 4000 6000 8000 
time (sec) 

fttfK S Continuous, real-time monitoring of a FCR. A fiber optic 
was oscd to carry, excitation fight to a FUR in progress and also 
emitted light back to a flooro meter (see Ex^nftcntal IVotorol). 
Amplification UAiag human malcDNA specific primers in 0 PCR 
starting with 20 ng of human male DNA (top), or in a control 
PGR without DNA (bottom}, were, monitored. Thirty cydes of 
PGR yrere followed for each. The tempcralun: Cycled between 
94°C (denaturatidn) and 50*0 (annealing and extension}. Note in 
the male DNA FCK».the cycle (dmc) dependent increase in 
fluorescence at the' aonealing/extcnaion teapetaturc. 
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I>NA— *ip to microgram amounts— in order to have suf- 
ficient numbers of target sequences. This large amount of 
starting Dr4A iti aft amplication sietiiGomtly increases 
the bacJ^round fluorescence over vrhicU any additional 
fluorescence produced by PGR must be detected. An 
additional complication that occurs with targets in tow 
copy-number is the formation of the ^rimer-dimer'' 
artifact. This is the result of the extension of one primer 
using the other primer as a template. Although this occurs 
infrequently, once it occurs the extension product is a 
substrate for PCR amplification, and can compete with 
true PCR targets if those targets are rare. The primter- 
dimer product i$ of course d$DNA and thus is a potential 
source of false signal in this homogeneous a*$ay, 

To increase PCR specificity and reduce the effect of 
primer* dimcr amplification, we are investigating a num- 
ber of apjproacb.es, including the use of nested-primer 
amplifications that take place in a single tube 8 , and the 
"hot-start", in which nonspeci6c amplification is reduced 
by raising the temperature of the reaction before DNA 
synthesis begins 8 *. Preliminary resuks using these ap- 
proaches suggest thatpirwncrKiirocT is effectively reduced 
and it is possible to detect the increase in Etfir fluores- 
cence in a PCR instigated by a single HIV genome in a 
background of 10* celts. With larger number* of cells, the 
background fluorescence contributed by genomic DNA 
becomes problematic. To, reduce this background, it may 
be possible to use sequence-specific DNA-binding dyes 
that can be made to preferentially bind PCR product over 
gcaomk DNA by incorporating the dye-binding DNA 
sequence into the PCR product through a 5' "add-on" to 
the oliffonucJcotidc primer 2-1 . 

We have shown that the detection of fluorescence 
generated by an EtBr-containing PCR is straightforward, 
both once PCR is completed and continuously during 
Ihcrmocycfing. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
protn&ng aspect of this- assay. The fluorescence analysis 
of completed PCRs is alrcadybossiblc with existing instru- 
mentation in 96-well format**. In this format* the fluores- 
cence in each PCR can be cjuantitated before, after, and 
even at selected points during therraocyciirig by moving 
the rack of PCRs to a 9$-microwH plate fluorescence 
readcrf. 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A direct c* tension of the apparatus used here is to have 
multiple fiberopdes transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number. Figure 5 shows that 
the larger the amount of starting target DNA, the sooner 
during PflR a fluorescence increase is detected. Prelimi- 
nary experiments <B iguchi and DoUinger, manuscript in 
preparation) with continuous monitoring have shown a 
$cn$itivicy to two-fold differences in initial target DNA 
concentration. 

Conversely, if the number of target molecules is 
known — as it can be in genetic screening--i»ntimious 
monitoring may provide a means of detecting fabc posi- 
tive and false negative results. With a known number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR. 
Increases in fluorescence detected before or after that 
cycle would indicate potential artrfocu* False negative 
results due to, for example, . inhibition of DNA polymer- 
ase, may be detected by including within each PCR an 
inefficiendy amplifying marker. This marker results in a 
fluorescence increase only after a large number of cy- 
cles — many more than arc necessary to detect a true 



positive. If a sample fails to have a fluorescence increase 
alter this many- cycles, inhibition may be suspected. Since, 
b this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event before any test based on this 
principle is ready for the clinic, an assessment of te false 
posiuWfalse negative rates wfll weed to be obtained' using 
a large number of known samples. 

In summary, the inclusion in PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes u 
possible to detect spcci&c DNA amplification from outside 
the PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
in applications that demand the high tiunoughput of 
samples. 

EXPERIMENTAL PROTOCOL 

Human HLA-DQa gene amplifications ctmtaiiung EtQr. 
PCRs were set up htlOD f*l volumes containing 10 mM Tris-HCL 
pH 8.3; 50 mM KCJ; 4 mM MgO*: 3-$ unit* of taa DNA 
poJymCMSC (PterVhuElmcr Octm, Norwalk, CT); 20 pmolc each 
of human HtA-DQa 1 gene specific oligonucleotide primers 
(iH26 and CH27 19 and approximately W copies of DQfc PCK 
product diluted from a previous reaction. Ethidium bromide 
(fobr, StgtnA) was wed M the concentrations indicated b Figure 
2. Thermocyding proceeded for 20 cycles in a model 4&0 
therraocyder <Perltji>-Elrtier Ccrua, NorWk, CT) uanfr a "step- 
cycle" program of 94*C for 1 mm^ dcnaturalion and 6frC for $0 
sec annealing and 72°C for 30 sec. e&tenstoo, 

Y-chromomnc specific PCR* PCRs ()00 \il total reaction 
volume) cemtaining XL* p£/ral EtBr were prepared as described 
for HLA»DQct, except with different primers and target DNAs. 
These PCRs contained J $ pmolc each male DNA-tpecific primes 
Yl.l andVl,2 M , and cither $0 ng male, 60 dg female, St ng male, 
or do human UNA. Thermocycliti^ £K*CTor 1 min, and 60?C 
for 1 min using a "stcp-cyde" program. The number of cycles for 
a sample were as indicated in Figure 3. Fluorescence measure- 
ment is described below. 

Allek-apccificy human &-£loWn gew* PCR. AropHncauons of 
100 f4 venume vtm£ 0 5 MgAnl of ZtBr were prewired a* 
described for HLA-DQot above except with different pfUnw and 
target DNAs. These PCRs contained eiUier primer pair HGP2/ 
HPHA<wi»<Hype giobin ^peciac primers) or MOP2/R&HS (skk- 
le-^lobin specific primers) at 10 pmole f*Ch primer per fCR, 
These primers were developed by Wu ct aL 2 \ Three different 
tatgei DNAs were tifed ia separate amplifications!— 60 ng eadti of 
human DNA that was homozygous for the xfcUtc trail ($S), DMA 
that was heterozygous for the sickle wak (AS), or DNA that was 
homozygous For the w.t- ^Jobm (AA). Thcrmocycfing wm fVir SO 
cyctes at 94*C for 1 mm. and S5*C for 1 .min. itshvg n * , !^u:p-cYcle , ' 
program. An anoeaKttg tempcratuxe of 55X b^^^cen shown by 
VVa et al, 21 to provide alldc-cpcrifk atPplification- Completed 
PCRs vere phou>gra»hcci uirough a red fitter (Written &3A) 
after pl&dng the rcacucm tube* atop' a model TM-36 transiHitmi- 
nator (UV-productS Saiv Gabriel, CA). 

Ruorescencc weasuremem. Ftuorescei>ce measurement* were 
made oh PCRs containing EtBr in a Fluorolog-2 flUoromCtcT 
(SPEX Edison, NJ). Ewitation was at the 500 ran band with 
•abour 2 nm bandwidth with a GG 435 ntn cut-off filter jMcllcs 
Crist Ibcm Irvine* CA) to exclude secend-order light. Emitted 
light wat detected at 570 nm with a bandwidth of about 7 nm- An 
OG 530 »m cut-off .filter was used to remove the excitation Egtot- 
ContfottOttft rraorescence mentoring of PCR, Condnuou^ 
monitoring or a PCR in progress was accomplished using die 
Bpcctroftuoromeicr and aetringa descrtbed >.bovc as well as a 
fiberoptic accessory (SPEX cat no. 1950) to both send excitation 
fight to, and receive emitted light from, a PCR placed in a well of 
a model £80 ihernvocyctcr (Perkm-Elrtcr Getus). The probe end 
of the Roeroptic cable was attached with "5 nwautc-cpoxy* to tht 
open top of a PCR tube (a 0.5 ml pc^ypropyiene centrifuge tube 
with it* cap removed) effectively scaling il The txposccf top of 
the PCR luoe and the end of the fiberoptic caWe were shielded 
from room light and the room light* were kept dimmed during 
each run. The monitored PCR was an ampltficauon of Y-dbro- 
rnosorno-fipecirk repeat $eqvence$ an described above, except 
ueing.an anncahng/exteniJon tempera lu re of BOX. The reaction 
was covered whii xnii>e«al oil (2 drops) to prevent evaporation. 
Thermocycfing-and- fluorcsocncc roca^urcmcnt were started si' 
multaneously, A tune-base scan witfi a 10 second mtcgrado»' tone 
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cs«a wed and the emtettotk signal was ratioed to tbc excitation 
nigoal to control foe changes in light-iourcc iRtcrurity. jfeta.wcrc 
collected using the draSOOOf, version 15 (SPEX) data system. 
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IMMUNO BIOLOGICAL LABORATORIES 



6CD-14EUSA 

Trauma, Shock and Sepsis 




The CD-14 molecule is' expressed on the surface of 
monocytes and some macrophages. Membrane- 
bound CD-14 is a receptor for lipopolysaccharide 
(LPS) complexed to LPS-8inding-Protein (LBP). The 
concentration of tts soluble form is altered under 
certain pathological conditions. There, is evidence for 
an important role of $CD-14.with polytrauma, sepsis, 
burnings and inflammations. 
During septic conditions and acute infections it seems 
to be a prognostic manner and is therefore of vaJue in 
monftorlng'these patients. 



IBL offers an EUSA for quantitative determination of 

soluble CD-14 in human serum, -piasma, cell-culture 

supematants and other biological fluids. 

Assay features: 12x8 determinations 
(microliter strips), 
precoated with a specific 
monoclonal antibody, 
2x1 hour incubation, 
standard range: 3-96 ng/ml 
detection limit: 1 ng/ml 
CV: intra- and tnteressay < 8% 



For more information caH or fax 



QESELLSCHAFT FUR IMMUNCHEMIE UND - BIOLOGlE^MRH 

OSTERSTRASSE 86 -D- 2000 HAMBURG 20 • GERMANY- TEL. +40/49100 61-64 • FAX + 40 /4011 98 



PAGE 6/6 ' RCVD AT 7/19/2004 3:1 0:03 PM pacific Daylight Time] 1 SVR:SVCS01/0 * DNIS:6638 1 CSID:650 952 988 1 * DURATION (mm-ss):0446 



From (613) 99X-5695 Q^r M "<811209DP04816741 Mon 24 Oct 200^ " 35 PM EDT Page 2 of 7 




WIIIIIResearch 



Oligonucleotides with Fluorescent Dyes at 
Opposite Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

Kenneth J. Livak, Susan J.A. Flood, Jeffrey Marmaro, William Ciusti, and Karin Deetz 

Perkin-EImer, Applied Biosystems Division, Foster City, California 94404 



The 5' nuclease PCR assay detects the 
accumulation of specific PCR product 
by hybridization and cleavage of a 
double-labeled fluorogenk probe 
during the amplification reaction. 
The probe is an oligonucleotide with 
both a reporter fluorescent dye and a 
quencher dye attached. An Increase 
In reporter fluorescence Intensity In- 
dicates that the probe has hybridized 
to the target PCR product and has 
been cleaved by the 5' ->3' nucle- 
olytlc activity of Tag DNA polymerase. 
In this study, probes with the 
quencher dye attached to an internal 
nucleotide were compared with 
probes with the quencher dye at- 
tached to the 3 '-end nucleotide. In all 
cases, the reporter dye was attached 
to the 5' end. All Intact probes 
showed quenching of the reporter 
fluorescence* In general, probes with 
the quencher dye attached to the 3'- 
end nucleotide exhibited a larger sig- 
nal In the 5' nuclease PCR assay than 
the Internally labeled probes. It is 
proposed that the larger signal Is 
caused by increased likelihood of 
cleavage by Taq DNA polymerase 
when the probe Is hybridized to a 
template strand during PCR. Probes 
with the quencher dye attached to 
the 3 -end nucleotide also exhibited 
an Increase in reporter fluorescence 
Intensity when hybridized to a com- 
plementary strand. Thus, oligonucle- 
otides with reporter and quencher 
dyes attached at opposite ends can 
be used as homogeneous hybridiza- 
tion probes, 



homogeneous assay for detecting 
the accumulation of specific PCR prod- 
uct that uses a double-labeled fluoro- 
genic probe was described by Lee et al. (l) 
The assay exploits the 5'-* 3' nucle- 
olyric activity of Taq DNA poly- 
merase^ 35 and is diagramed in Figure l. 
The fluorogenic probe consists of an oli- 
gonucleotide with a reporter fluorescent 
dye, such as a fluorescein, attached to 
the 5' end; and a quencher dye, such as a 
rhodamine, attached internally. When 
the fluorescein is excited by irradiation, 
its fluorescent emission will be 
quenched if the rhodamine is close 
enough to be excited through the pro- 
cess of fluorescence energy transfer 
(FET).< 4 - 5) During PCR, if the probe is hy- 
bridized to a template strand, Taq DNA 
polymerase will cleave the probe be- 
cause of its inherent 5' -* 3' nucleolytic 
activity. If the cleavage occurs between 
the fluorescein and rhodamine dyes, it 
causes an increase in fluorescein fluores- 
cence intensity because the fluorescein 
is no longer quenched. The increase in 
fluorescein fluorescence intensity indi- 
cates that the probe-specific PCR product 
has been generated. Thus, FET between a 
reporter dye and a quencheT dye is criti- 
cal to the performance of the probe in 
the 5' nuclease PCR assay. 

Quenching is completely dependent 
on the physical proximity of the two 
dyes. C6) Because of this, it has been as- 
sumed that the quencher dye must be 
attached near the 5' end. Surprisingly, 
we have found that attaching a rho- 
damine dye at the 3' end of a probe 
still provides adequate quenching for 
the probe to perform in the 5' nuclease 



PCR assay. Furthermore, cleavage of this 
type of probe is not required to achieve 
some reduction in quenching. Oligonu- 
cleotides with a reporter dye on the 5' 
end and a quencher dye on the 3' end 
exhibit a much higher reporter fluores- 
cence when double-stranded as com- 
pared with single-stranded. This should 
make it possible to use this type of dou- 
ble-labeled probe for homogeneous de- 
tection of nucleic acid hybridization. 

MATERIALS AND METHODS 
Oligonucleotides 

Table 1 shows the nucleotide sequence 
of the oligonucleotides used in this 
study. Linker arm nucleotide (LAN) 
phosphoramidite was obtained from 
Glen Research. The standard DNA phos- 
phoramidites, 6-carboxyfiuorescein (6- 
FAM) phosphoramidite, 6-carboxytet- 
ramethyirhodamine succinimidyl ester 
(TAMRA NHS ester), and Phosphalink 
for attaching a 3'-blocking phosphate, 
were obtained from Perkin-Elmer, Ap- 
plied Biosystems Division. Oligonucle- 
otide synthesis was performed using an 
ABI model 394 DNA synthesizer (Applied 
Biosystems). Primer and complement 
oligonucleotides were purified using 
Oligo Purification Cartridges (Applied 
Biosystems). Double-labeled probes were 
synthesized with 6-FAM-labeled phos- 
phoramidite at the 5' end, LAN replacing 
one of the Ts in the sequence, and Phos- 
phalink at the 3' end. Following de- 
protection and ethanol precipitation, 
TAMRA NHS ester was coupled to the 
LAN-containing oligonucleotide in 250 
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FIGURE 1 Diagram of 5' nuclease assay. Stepwise representation of the 5' - 3' ™cleol£c ac- 



mvi Na-bicarbonate buffer (pH 9.0) at 
room temperature. Unreacted dye was 
removed by passage over a PD-10 Sepha- 
dex column. Finally, the double-labeled 
probe was purified by preparative high- 
performance liquid chromatography 
(HPLC) using an Aquapore C 8 220 x 4.6- 
mm column with 7-pm particle size. The 
column was developed with a 24-min 
linear gradient of 8-20% acetonitrile in 
0.1 m TEAA (triethylamine acetate). 
Probes are named by designating the se- 
quence from Table 1 and the position of 
the IAN-TAMRA moiety. For example, 
probe Al-7 has sequence Al with LAN- 
TAMRA at nucleotide position 7 from the 
5' end. 



PCR Systems 

All PGR amplifications were performed 
in the Perkin-Elmer GeneAmp PCR Sys- 
tem 9600 using 50-u-l reactions that con- 
tained 10 mM Tris-HCl (pH 8.3), 50 mM 
KC1, 200 jtM dATP, 200 uw dCTP, 200 
dGTP, 400 m-m dUTP, 0.5 unit of AmpEr- 
ase uracil N-glycosylase (Perkin-Elmer), 
and 1.25 unit of AmpliTaq DNA poly- 
merase (Perkin-Elmer). A 295-bp seg- 
ment from exon 3 of the human p-actin 



gene (nucleotides 2141-2435 in the se- 
quence of Nakajlma-IMima et aL) (7) was 
amplified using primers AFP and ARP 
(Table 1), which are modified slightly 
from those of du Breuil et al. (8> Actin am- 
plification reactions contained 4 mM 
MgC! 2 , 20 ng of human genomic DNA, 
SO nM Al or A3 probe, and 300 hm each 



primer. The thermal regimen was 50°C 
(2 min), 95°C (10 min), 40 cycles of 95°C 
(20 sec) r 60°C (1 min), and hold at 72°C. 
A 515-bp segment was amplified from a 
plasmid that consists of a segment of X 
DNA (nucleotides 32,220-32,747) in- 
serted in the Smal site of vector pUC119. 
These reactions contained 3.5 mM 
MgCl* 1 ng of plasmid DNA, 50 nM P2 or 
P5 probe, 200 nM primer F119, and 200 
nM primer R119; The thermal regimen 
was 50°C (2 min), 95°C (10 min), 25 cy- 
cles of 95°C (20 sec), 57°C (1 min), and 
hold at 72°C 



Fluorescence Detection 

For each amplification reaction, a 40-p,l 
aliquot of a sample was transferred to an 
individual well of a white, 96-well mlcro- 
titer plate (Perkin-Elmer). Fluorescence 
was measured on the Perkin-Elmer Taq- 
Man LS-50B System, which consists of a 
luminescence spectrometer with plate 
reader assembly, a 485-nm excitation fil- 
ter, and a 515-nm emission filter. Excita- 
tion was at 488 nm using a 5-nm slit 
width. Emission was measured at 518 
nm for 6-FAM (the reporter or R value) 
and 582 nm for TAMRA (the quencher or 
d value) using a 10-nm slit width. To 
determine the increase in reporter emis- 
sion that is caused by cleavage of the 
probe during PCR, three normalizations 
are applied to the raw emission data. 
First, emission intensity of a buffer blank 
is subtracted for each wavelength. Sec- 
ond, emission intensity of the reporter is 



TABLE 1 Sequences of Oligonucleotides 



Name 


Type 


F119 


primer 


R119 


primer 


P2 


probe 


P2C 


complement 


P5 


probe 


P5C 


complement 


AFP 


primer 


ARP 


primer 


Al 


probe 


A1C 


complement 


A3 


probe 


A3C 


complement 



Sequence 



ACCCACAGGAACTGATCACCACrC 

ATGTCGCGTTCCGGCTGACGTTCrGC 

TCGCATTACTGATCGTrGCCAACCAGTp 

GTACTGGTTGGCAACGATCAGTAATGCGATG 

CGGA'ITTGCTGGTATCrATGACAAGGATp 

TTCATCCTTGTCATAGATACCAGCAAATCCG 

TCACCCACACTGTGCCCATCTACGA 
CAGCGGAACCGCrCATTGCCAATGG 
ATGCXCTCCCXX^TGCCATCCIGCGTp 
AGACGCAGGATGGCATGGGGGAGGGCATAC 

CGCCCTGGACTTCGAGCAAGAGATp 
CCATCrCTTGCTCGAAGTCCAGCGCGAC _ 



Foi each oligonucleotide used in this study, the nucleic aad sequence is given, wrltter i ni the 
5' 3' direction. There are three types of oligonudeotides: PCR primer, fluorogenlc P«*«*J 
in the s' nuclease assay, and complement used to hybridize to the corresponding probe. For tne 
the ^ a posiuon where LAN with TAMRA attached was substi- 

tuted for a T. (p) The presence of a 3' phosphate on each probe. 
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Ai-2 RaQgccctcccccatgccatcctgcgtp 

A1-7 RATGCCCQCCCCCATGCCATCCTGCGTp 

A1-14 RATGCCCTCCCCCAQGCCATCCTGCGTp 

A1-1 9 IUTGCCCTCCCGCATGCCAQCCTCXXJTp 

A1-22 Ratgccctccxx;catgccatccQgcgtp 

A 1-26 RATGCCCTC(XCCATGCCATCCTGCGQp 



Probe 


518 nm 


582 nm 


RQ- 


RQ+ 


ARQ 




no temp. 


+ temp. 


no temp. 


+ temp. 








A1-2 


25.5 ±2.1 


32.7±1.9 


38.2 + 3.0 


38.2 ±2.0 


0.67 + 0.01 


0.B6±0.06 


0.19 ±0.06 


A1-7 


. 53.5 ±4.3 


395.1 ±21.4 


108.5 ±6.3 


110.3 ±5.3 


0.49 ±0.03 


3.58 ±0.17 


3.09 ±0.1 8 


A1-14 


127.0 ±4.0 


403.5 ±19.1 


109.7 ±5.3 


93.1 ±6.3 


1.16 ±0.02 


4.34 ±0.15 


3.18±0.15 


A1-19 


187.5 ±17.9 


422.7 ±7.7 


70.3 ±7.4 


73.0 ±2.8 


267 ±0.05 


5.80 ±0.15 


3.13 + 0.16 


A 1-22 


224.6 ±9.4 


482.2 ±43.6 


100.0 + 4.0 


96.2 ±9.6 


2.25 ±0.03 


5.02±0.11 


2.77 ±0.12 


A1-26 


160.2 ±8.9 


454.1 ±18.4 


93.1 ±5.4 


90.7 ±3.2 


1.72 ±0.02 


5.01±0.0B 


3.29 ±0.08 



FIGURE 2 Results of 5' nuclease assay comparing 0-actin probes with TAMRA at different nucle- 
otide positions. As described in Materials and Methods, PCR amplifications containing the in- 
dicated probes were performed i and the fluorescence emission was measured at 518 and 582 nm. 
Reported values are the average±l s.D. for six reactions run without added template (no temp.) 
and six reactions run with template (-Kemp.). The RQ ratio was calculated for each individual 
reaction and averaged to give the reported RQ" and RQ + values. 



divided by the emission intensity of the 
quencher to give an RQ ratio for each 
reaction tube. This normalizes for well- 
to-well variations in probe concentra- 
tion and fluorescence measurement. Fi- 
nally, ARQ is calculated by subtracting 
the RQ value of the no-template control 
(RQ~) from the RQ value for the com- 
plete reaction including template 
(RCT)< 

RESULTS 

A series of probes with increasing dis- 
tances between the fluorescein reporter 
and rhodamine quencher were tested to 
investigate the minimum and maximum 
spacing that would give an acceptable 
performance in the 5' nuclease PCR as- 
say. These probes hybridize to a target 



sequence in the human p-actin gene. 
Figure 2 shows the results of an experi- 
ment in which these probes were in- 
cluded in PCR that amplified a segment 
of the 0-actin gene containing the target 
sequence. Performance in the S' nu- 
clease PCR assay is monitored by the 
magnitude of ARQ, which is a measure 
of the increase in reporter fluorescence 
caused by PCR amplification of the 
probe target. Probe Al-2 has a ARQ value 
that Is close to zero, indicating that the 
probe was not cleaved appreciably dur- 
ing the amplification reaction. This sug- 
gests that with the quencher dye on the 
second nucleotide from the 5' end, there 
is insufficient room for Taq polymerase 
to cleave efficiently between the reporter 
and quencher. The other five probes ex- 
hibited comparable ARQ values that are 
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clearly different from zero. Thus, all five 
probes are being cleaved during PCR am- 
plification resulting in a similar increase 
in reporter fluorescence. It should be 
noted that complete digestion of a probe 
produces a much larger increase in re- 
porter fluorescence than that observed 
in Figure 2 (data not shown). Thus, even 
in reactions where amplification occurs, 
the majority of probe molecules remain 
uncleaved. It is mainly for this reason 
that the fluorescence intensity of the 
quencher dye TAMRA changes little with 
amplification of the target This is what 
allows us to use the 582-nm fluorescence 
reading as a normalization factor. 

The magnitude of RQ" depends 
mainly on the quenching efficiency in- 
herent in the specific structure of the 
probe and the purity of the oligonucle- 
otide. Thus, the larger RQ" values indi- 
cate that probes Al-14, Al-19, Al-22, and 
Al-26 probably have reduced quenching 
as compared with Al-7. Still, the degree 
of quenching is sufficient to detect a 
highly significant increase in reporter 
fluorescence when each of these probes 
is cleaved during PCR. 

To further investigate the ability of 
TAMRA on the 3' end to quench 6-FAM 
on the 5' end, three additional pairs of 
probes were tested in the 5 ' nuclease 
PCR assay. For each pair, one probe has 
TAMRA attached to an internal nucle- 
otide and the other has TAMRA attached 
to the 3' end nucleotide. The results are 
shown in Table 2. For all three sets, the 
probe with the 3' quencher exhibits a 
ARQ value that is considerably higher 
than for the probe with the internal 
quencher. The RQ" values suggest that 
differences in quenching are not as great 
as those observed with some of the Al 
probes. These results demonstrate that a 
quencher dye on the 3' end of an oligo- 
nucleotide can quench efficiently the 



TABLE 2 Results of 5' Nuclease Assay Comparing Probes with TAMRA Attached to an Internal or 3'-terminal Nucleotide 
518 nm 582 nm 



Probe 


no temp. 


+ temp. 


no temp. 


+ temp. 


RQ" 


RQ + 


ARQ 


A3-6 
A3-24 


54.6 ± 3.2 
. 72.1 ± 2.9 


84.8 ±3.7 
236.5 ± 11,1 


116.2 ± 6.4 
84,2 ± 4.0 


115.6 ±2.5 
90.2 ± 3.8 


0.47 ± 0.02 
0.86 ± 0.02 


0.73 ± 0.03 
2.62 ± 0.OS 


0.26 ± 0.04 
1.76 ± 0.05 


P2-7 
P2-27 


82.8 ± 4.4 
113.4 ±6.6 


384.0 ± 34.1 
555.4 ± 14.1 


105.1 ± 6.4 
140.7 ± 8.5 


120.4 ± 10.2 
118.7 ±4.8 


0.79 ± 0.02 
0.81 ± 0.01 


3.19 ±0.16 
4.68 ± 0.10 


2.40 ±0.16 
3.88 ± 0.10 


P5-10 
P5-28 


77.5 ± 6.5 
64.0 ± S.2 


244.4 ± 15.9 
333.6 ±12.1 


86.7 2 4.3 
100.6 ± 6.1 


95.8 ± 6.7 
94.7 ± 6.3 


0.89 ± 0.05 
0.63 ± 0.02 


2.55 ± 0.06 
3,53 * 0.12 


1.66 ± 0.08 
2.89 ± 0.13 



Reactions containing the indicated probes and calculations were performed as described in Material and Methods and in the legend to Fig. 2. 
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fluorescence of a reporter dye on the 5' 
end. The degree of quenching is suffi- 
cient for this type of oligonucleotide to 
be used as a probe in the 5' nuclease PCR 
assay. 

To test the hypothesis that quenching 
by a 3' TAMRA depends on the flexibility 
of the oligonucleotide, fluorescence was 
measured for probes in the single- 
stranded and double-stranded states. Ta- 
ble 3 reports the fluorescence observed 
at 518 and 582 nm, The relative degree 
of quenching is assessed by calculating 
the RQ ratio. For probes with TAMRA 
6-10 nucleotides from the 5' end, there 
is little difference in the RQ values when 
comparing single-stranded with double- 
stranded oligonucleotides. The results 
for probes with TAMRA at the 3' end are 
much different. For these probes, hy- 
bridization to a complementary strand 
causes a dramatic increase in RQ. We 
propose that this loss of quenching is 
caused by the rigid structure of double- 
stranded DNA, which prevents the 5' 
and 3' ends from being in proximity. 

When TAMRA is placed toward the 3' 
end, there is a marked Mg 2 * effect on 
quenching. Figure 3 shows a plot of ob- 
served RQ values for the Al series of 
probes as a function of Mg 2 " 1 " concentra- 
tion. With TAMRA attached near the 5' 
end (probe Al-2or Al-7), the RQ value at 
O mM Nig 2 * is only slightly higher than 
RQ at 10 mM Mg 2 *. For probes Al-19, 
Al-22, and Al-26, the RQ values at 0 urn 
Ivfg 2 * are very high, indicating a much 



reduced quenching efficiency. For each 
of these probes, there is a marked de- 
crease in RQ at 1 mM Mg 2 * 1 " followed by 
a gradual decline as the Mg 2+ concen- 
tration Increases to 10 mM. Probe Al-14 
shows an intermediate RQ value at 0 mM 
Mg 2 * with a gradual decline at higher 
Mg 2 * concentrations. In a low-salt en- 
vironment with no Mg 2+ present a sin- 
gle-stranded oligonucleotide would be 
expected to adopt an extended confor- 
mation because of electrostatic repul- 
sion. The binding of Mg 2 * ions acts to 
shield the negative charge of the phos- 
phate backbone so that the oligonucle- 
otide can adopt conformations where 
the 3 r end is close to the 5' end. There- 
fore, the observed Mg 2 *** effects support 
the notion that quenching of a 5' re- 
porter dye by TAMRA at or near the 3' 
end depends on the flexibility of the oli- 
gonucleotide. 

DISCUSSION 

The striking finding of this study is that 
it seems the rhodamine dye TAMRA, 
placed at any position in an oligonucle- 
otide, can quench the fluorescent emis- 
sion of a fluorescein (6-FAM) placed at 
the 5' end. This implies that a single- 
stranded, double-labeled oligonucle- 
otide must be able to adopt conforma- 
tions where the TAMRA is close to the 5 r 
end. It should be noted that the decay of 
6-FAM in the excited state requires a cer- 
tain amount of time. Therefore, what 



matters for quenching is not the average 
distance between 6-FAM and TAMRA 
but, rather, how close TAMRA can get to 
6-FAM during the lifetime of the 6-FAM 
excited state. As long as the decay time of 
the excited state Is relatively long com- 
pared with the molecular motions of the 
oligonucleotide, quenching can occur. 
Thus, we propose that TAMRA at the 3 ; 
end, or any other position, can quench 
6-FAM at the 5' end because TAMRA is in 
proximity to 6-FAM often enough to be 
able to accept energy transfer from an 
excited 6-FAM. 

Details of the fluorescence measure- 
ments remain puzzling. For example, Ta- 
ble 3 shows that hybridization of probes 
Al-26, A3-24, and P5-28 to their comple- 
mentary strands not only causes a large 
increase in 6-FAM fluorescence at 518 
nm but also causes a modest increase in 
TAMRA fluorescence at 582 nm, If 
TAMRA is being excited by energy trans- 
fer from quenched 6-FAM, then loss of 
quenching attributable to hybridization 
should cause a decrease in the fluores- 
cence emission of TAMRA. The fact that 
the fluorescence emission of TAMRA in- 
creases indicates that the situation is 
more complex. For example, we have an- 
ecdotal evidence that the bases of the 
oligonucleotide, especially G, quench 
the fluorescence of both 6-FAM and 
TAMRA to some degree. When double- 
stranded, base-pairing may reduce the 
ability of the bases to quench. The pri- 
mary factor causing the quenching of 
6-FAM in an intact probe is the TAMRA 
dye. Evidence for the importance of 
TAMRA is that 6-FAM fluorescence 
remains relatively unchanged when 
probes labeled only with 6-FAM are used 
in the 5' nuclease PCR assay (data not 
shown). Secondary effectors of fluores- 
cence, both before and after cleavage of 
the probe, need to be explored further. 

Regardless of the physical mecha- 
nism, the relative independence of posi- 
tion and quenching greatly simplifies 
the design of probes for the 5' nuclease 
PCR assay. There are three main factors 
that determine the performance of a 
double-labeled fluorescent probe in the 
5' nuclease PCR assay. The first factor is 
the degree of quenching observed in the 
intact probe. This is characterized by the 
value of RQ", which is the ratio of re- 
porter to quencher fluorescent emis- 
sions for a no template control PCR. In- 
fluences on the value of RQ~ include 
the particular reporter and quencher 



TABLE 3 Comparison of Fluorescence Emissions of Single-stranded and 
Double-stranded Fluorogenic Probes 



518 nm 582 nm RQ 



Probe 


ss 


ds 


ss 


ds 


ss 


ds 


Al-7 


27.75 


68.53 


61.08 


138.18 


0.45 


0.50 


Al-26 


43.31 ' 


509 .38 


53.50 


93.86 


0.81 


5.43 


A3-6 


16.75 


6Z.88 


39.33 


165.57 


0.43 


0.38 


A3-24 


30.05 


578.64 


67.72 


140.2S 


0.45 


3.21 


P2-7 


35.02 


70.13 


S4.63 


121.09 


0.64 


0.58 


P2-27 


39.89 


320.47 


6S.10 


61.13 


0.61 


5.25 


P5-10 


27.34 . 


144.85 


61.95 


165.54 


0.44 


0.87 


P5-28 


33.65 


462.29 


72.39 


104.61 


0.46 


4.43 



(ss) Single-stranded. The fluorescence emissions at 518 oi 582 nm for solutions containing a final 
concentration of 50 nw indicated probe, 10 mM Tris-HCl (pH 83), 50 mM KC1, and 10 mM MgCl 2 . 
(ds) Double-stranded. The solutions contained, in addition, 100 nM A1C for probes Al-7 and 
Al-26, 100 nM A3C for probes A3-6 and A3-24, 100 nvi P2C for probes P2-7 and P2-27, or 100 nM 
P5C for probes P5-10 and P5-28. Before the addition of MgCl* 120 ui of each sample was heated 
at 95°C fox 5 roin. Following the addition of 80 yJ of 25 mM MgCl^ each sample was allowed to 
cool to room temperature and the fluorescence emissions were measured. Reported values are 
the average of three determinations. 
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mM Mg 

FIGURE 3 Effect of Mg 2 * concentration on RQ ratio for the Al series of probes. The fluorescence 
emission intensity at 518 and 582 am was measured for solutions containing 50 nM probe, 10 mM 
Trls-HCl (pH 8.3), 50 mM KCl, and varying amounts (O-10 mM) of Mgd 2 . The calculated RQ 
ratios (518 nm intensity divided by 582 run intensity) are plotted vs. MgCI 2 concentration (mM 
Mg). The key (upper right) shorn the probes examined. 



dyes used, spacing between reporter and 
quencher dyes, nucleotide sequence 
context effects, presence of structure or 
other factors that reduce flexibility of 
the oligonucleotide, and purity of the 
probe. The second factor is the efficiency 
of hybridization, which depends on 
probe T m , presence of secondary struc- 
ture in probe or template, annealing 
temperature, and other reaction condi- 
tions. The third factor is the efficiency at 
which Taq DNA polymerase cleaves the 
bound probe between the reporter and 
quencher dyes. This cleavage is depen- 
dent on sequence complementarity be- 
tween probe and template as shown by 
the observation that mismatches in the 
segment between reporter arid quencher 
dyes drastically reduce the cleavage of 
probe. (1) 

The rise in RQ" values for the Al se- 
ries of probes seems to indicate that the 
degree of quenching is reduced some- 
what as the quencher is placed toward 
the 3' end. The lowest apparent quench- 
ing is observed for probe Al-19 (see Fig. 
3) rather than for the probe where the 
TAMRA is at the 3' end (Al-26). This is 
understandable, as the conformation of 
the 3' end position would be expected to 
be less restricted than the conformation 
of an internal position. In effect, a 
quencher at the 3' end is freer to adopt 
conformations close to the 5' reporter 
dye than is an internally placed 
quencher. For the other three sets of 



probes, the interpretation of RQ" values 
is less clear-cut. The A3 probes show the 
same trend as Al, with the 3' TAMRA 
probe having a larger RQ~ than the in- 
ternal TAMRA probe. For the P2 pair, 
both probes have about the same RQ~ 
value. For the P5 probes, the RQ" for the 
3' probe is less than for the internally 
labeled probe. Another factor that may 
explain some of the observed variation is 
that purity affects the RQ~ value. Al- 
though all probes are HPLC purified, a 
small amount of contamination with 
unquenched reporter can have a large ef- 
fect on RQ", 

Although there may be a modest ef- 
fect on degree of quenching, the posi- 
tion of the quencher apparently can 
have a large effect on the efficiency of 
probe cleavage. The most drastic effect is 
observed with probe Al-2, where place- 
ment of the TAMRA on the second nu- 
cleotide reduces the efficiency of cleav- 
age to almost zero. For the A3, P2, and P5 
probes, ARQ is much greater for the 3' 
TAMRA probes as compared with the in- 
ternal TAMRA probes. This is explained 
most easily by assuming that probes 
with TAMRA at the 3' end are more likely 
to be cleaved between reporter and 
quencher than are probes with TAMRA 
attached internally. For the Al probes, 
the cleavage efficiency of probe Al-7 
must already be quite high, as ARQ does 
not Increase when the quencher is 
placed closer to the 3' end. This illus- 



trates the importance of being able to 
use probes with a quencher on the 3' 
end in the 5' nuclease PCR assay. In this 
assay, an increase in the intensity of re- 
porter fluorescence is observed only 
when the probe is cleaved between the 
reporter and quencher dyes. By placing 
the reporter and quencher dyes on the 
opposite ends of an oligonucleotide 
probe, any cleavage that occurs will be 
detected. When the quencher is attached 
to an internal nucleotide, sometimes the 
probe works well (Al-7) and other times 
not so well (A3-6). The relatively poor 
performance of probe A3-6 presumably 
means the probe is being cleaved 3' to 
the quencher rather than between the 
reporter and quencher. Therefore, the 
best chance of having a probe that reli- 
ably detects accumulation of PCR prod- 
uct in the 5' nuclease PCR assay is to use 
a probe with the reporter and quencher 
dyes on opposite ends. 

Placing the quencher dye on the 3' 
end may also provide a slight benefit in 
terms of hybridization efficiency. The 
presence of a quencher attached to an 
internal nucleotide might be expected to 
disrupt base-pairing and reduce the T m 
of a probe. In fact, a 2°C-3°C reduction 
in T m has been observed for two probes 
with internally attached TAMRAs. {9) This 
disruptive effect would be minimized by 
placing the quencher at the 3' end. Thus, 
probes with 3' quenchers might exhibit 
slightly higher hybridization efficiencies 
than probes with internal quenchers. 

The combination of Increased cleav- 
age and hybridization efficiencies means 
that probes with 3' quenchers probably 
will be more tolerant of mismatches be- 
tween probe and target as compared 
with internally labeled probes. This, tol- 
erance of mismatches can be advanta- 
geous, as when trying to use a single 
probe to detect PCR-ampiified products 
from samples of different species. Also, it 
means that cleavage of probe during PCR 
is less sensitive to alterations in an- 
nealing temperature or other reaction 
conditions. The one application where 
tolerance of mismatches may be a disad- 
vantage is for allelic discrimination. Lee 
et al. (1> demonstrated that alleie-spednc 
probes were cleaved between reporter 
and quencher only when hybridized to a 
perfectly complementary target This al- 
lowed them to distinguish the normal 
human cystic fibrosis allele from the 
AFS08 mutant. Their probes had TAMRA 
attached to the seventh nucleotide from 
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the 5' end and were designed so that any j 
mismatches were between the reporter 
and quencher. Increasing the distance 
between reporter and quencher would 
lessen the disruptive effect of mis- 
matches and allow cleavage of the probe 
on the incorrect target, thus, probes 
with a quencher attached to an internal 
nucleotide may still be useful for allelic 
discrimination. 

In this study loss of quenching upon 
hybridization was used to show that 
quenching by a 3' TAMRA is dependent 
on the flexibility of a single-stranded oli- 
gonucleotide. The increase in reporter 
fluorescence intensity, though, could 
also be used to determine whether hy- 
bridization has occurred or not. Thus, 
oligonucleotides with reporter and 
quencher dyes attached at opposite ends 
should also be useful as hybridization 
probes. The ability to detect hybridiza- 
tion in real time means that these probes 
could be used to measure hybridization 
kinetics. Also, this type of probe could be 
used to develop homogeneous hybrid- 
ization assays for diagnostics or other ap- 
plications. Bagwell et al. (,0) describe just 
this type of homogeneous assay where 
hybridization of a probe causes an in- 
crease in fluorescence caused by a loss of 
quenching. However, they utilized a 
complex probe design that requires add- 
ing nucleotides to both ends of the 
probe sequence to form two imperfect 
hairpins. The results presented here 
demonstrate that the simple addition of 
a reporter dye to one end of an oligonu- 
cleotide and a quencher dye to the other 
end generates a fluorogenic probe that 
can detect hybridization or PCR amplifi- 
cation. 
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Wc have developed a novel "real time" quantitative PCR method. The method measure* ICR product 
acoimuucfon through a duaWabeltd Ihioroeenlc probe (U, TaqMan Probe). This method provides very 
accurate And reproducible quantitation oF 8«nc copies. Unlike -other quantitative PCR methods, real-time PCR 
does nor require posi-PCR sample handling, preventing potential PCR product carry-over contamination and 
resulting in much faster and higher throughput assays. The real-time PCR method has a very large dynamic 
ranee of starting target molecule determination (at least five orders of magnitude). Real-time duantitartvc 
PCrt is extremely accurate and less labor-intensive than current quantitative PCR methods. 



Quantitative nucleic acid sequence analysis lias 
had an important role in many fields of biologi- 
cal research. Measurement of gene expression 
(RMA) has been used extensively In monitoring 
biological responses to various stimuli flan et al> 
1994; Huang ci ai. 1995a,h; Prud'homme ct at. 
1995). Quantitative gene analysis (DNA) has 
Ix-cn used to determine the gemuuu quantity of a 
particular gene, as in the case at the human H1LK2 
gene, which Is amplified in -30% of breast tu- 
mors (Slamon et at. 1987). CJcnc and genome 
quantitation (DNA and UNA) also have been used 
for analysis of human immunodeficiency virus 
(JiJV) buTden demonstrating changes in the lev- 
els of virus throughout the different phases of the 
disease (Connor et al. 1993; PJiitak el al. iw:th; 
Pintado et al. 1995). 

Many methods have been described for The 
quantitative analysis ot nucleic acid sequences 
(both for RNA and DNA; Southern VJfb; Sharp el 
al. 1980; Thomas 19«0). Recently, PCR lias 
proven to be a powerful tool for quantitative 
nucleic acid analysis. PCR and reverse transcrip- 
tase: (R'O-PCR have permitted the analysis of 
minimal starting quantities of nucleic acid (as 
little as one cell equivalent). This has made pos- 
sible many experiments that could not hove been 
performed with traditional methods. Although 
PCR bas provided a powerful tool, it is imperative 



thai \\ be uaed properly for quantitation (TUwy- 
maefcro 1995). Many early reports of quantita- 
tive PCK and RT-PCR described quantitation of 
the PCR product hut did not measure the initial 
target sequence quantity. It is essential to design 
proper controls for the quantitation of the initial 
target .sequences (1'errc 1992; Clcmc-ntl ct at. 
100?.) 

KeNMtrchcn have developed several methods 
of quantitative PCR and RT-PCR, One approach 
measures PCR product quantity in the l«g phase 
of the reunion before the plateau (Kellogg el al. 
1990; l^ng ct ah 1990). This method requires 
that each sample has equal Input amounts of 

■ nucleic- add and that each sornnlc under analysis 
amplifies with identical efficiency up to the. point 
of quantitative analysis. A gene sequence (mn- 
tained in alt .samples at relatively constant quan- 
tities, such as p-aclln) cum be used For sample; 
amplification efficiency normalization. Using 
conventional methods of PCR detection and 
quantitation (gel electrophoresis or plate capture 
hybridization), it is extremely laborious to assure 
that all samples tire analyzed during the log phase 
of the reaction (for bolh the target gene and the 
normalization gene), Another method, quantita- 
tive competitive (QQ-PCR, has been developed 
and is used widely for PCR quantitation. QC-PCR 
relics on the inclusion of an internal control 
competitor in each reaction (Becker-Andre 1991; 
Hatak et til. I993a,b). The efficiency of each re- 
action is normalised to the internal competitor. 
a wnnwn a itu u in I o( lmemaJ competitor can be 

antoM rncfl no/ «*« wa rc:&t 7nn7/cn/7T 
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oddod to each sample. To obtain relative quant- 
tatlon, the unknown target PGR product is com- 
pared with the known 'competitor K'.U product. 
Success of si quantitative competitive I'CR assay 
relics on Cicvcloplng an Internal control ilinl am- 
jriinra vvtth the same efficiency e.s the Uugci mol- 
cculc. The design of the coinpedtot and the vn)l- 
cation of amplification efficicncicr* ic quire a 
dedicated effort. However, becnusc QCMTR docs 
not require that PCitt pioducls be analy?Hd during 
the lot; phase of Hie. amplification, it is tin*, easier 
vf the two methods to use. 

Several detection system* iiie use.d for quan 
Utative l'CK and RT-PCll analysis; (1) a^rose 
gels, (2) fluorescent labeling of T*OU products and 
detection with InaiTT'induccd fluorescence -using 
capillary electrophoresis (h«sco ct ah 1995; Wil- 
liams et ;il. 1 996) or acryiaiuldc gels, and (.3) white 
capture and sandwich probe hybridization (Mul- 
der et al. 1994). Although these methods proved 
successful, each method requires posl-l'CR my- 
alpulatlons that add Time to the analysis and 
may lead lu hibiuatuty <. iiiilnuiination. The 
sample throughput of these jnrtlu>d>. i.s limited 
(wllb the t-xccpilon of the plate capture ap- 
proach), wiwl, lhwn:font, these methods, ore not 
well suited fwi um.'.> dvn landing high sample 
throughput (I.e., screening of large numbers of 

blomwtc^LiI<rs oi Jimlyy.ln^ SAmplva fui diagnua- 
Ucs or clinical trials). 

I lore wv report the development of » novel 
ii.isay for quantitative H)NA analysis. The assay is 
Iwsgd on <he use*. -of the .S' ' nuclease assay first 
described by Hollund et al. (1991). TJie method 
u5es the 5' nuclease aciivliy of 7Vj</ polymerase to 
cleave a non extendible hyhridl/aiion probe dur- 
ing rhr extension phnsr of T'OU- Th« approach 
uses dual-labeled fluorogenic hybridisation 
probt-s (Lcc et id. 1 S>5>3; Mossier ct al. 19^3; 1 Jvak 
ct al, l$9oo,b). One fluorescent dye serves as a 
reporter [FAM (i.e., (^carbo.xynuorcavcm)| nnd its 
emission spectra is quenched by the second fluo- 
rescent dye, TAMRA (I.e., rt-carboxy-ietramethyl- 
rhodaminc). The nuclease degradation of the hy- 
brtdi/.atton probe releases the quenching of the 
I'AM fluorescent emission, resulting in an In- 
crease in peak fluorescent emission at SJg nm. 
The use Of a sequence detector (Abl Prism) allows 
measujcment of fluorescent spectra of all 96 wells 
of the thermal cycler continuously during the 
J'CR nrnphftcarlon. Therefore, the reucliuna ajc 
monitored in real lime. The owtpui data is de- 
scribed and quantitative analysis of input target 
I )NA sequences L5 discussed below. 



REAL HML 0UANIIU1IV1 PCK 

RESULTS 

PGR Product Dercctlon in R«al Time 

The goal was to develop a high-throughput, sen- 
sitive, and nccurato gene quantitation assay for 
use In monitoring lipid mediated therapeutic 
gene delivery. A plasm id encoding human factor 
VIII gene sequence, pK8TM (sec Methods), was 
used as a model ihorapeutic gene. The assay use* 
fluorescent Taqman methodology and an instru- 
ment capable of measuring fluorescence in real 
time (AM Prism 7700 Sequence r>ctrrlnr). Hu: 
Taqman reaction requires » hybrldl2atto7> jwnhr* 
lal>cled with two different fluorescent dyes. One 
dye Is a reporter dyw {I«'AM), the other is x t^uench- 
ing dye (TAMRA). When the prol**: \s intact, fluo- 
test en I energy transfer occurs and the reporter 
dye fluorcsconf emission 19 nbsorbc.d by the 
quenching dye (TAMRA). During the ex tension 
phase of the l'CR cycle, the fluorescent hybrid- 
l/iil Inn probe cleaved by the S'-.'V nuclcolytic 
actlvity of thr DNA polymerase. On cleavage of 
the probe, the reporter dye emission is no longer 
transferred efficiently to the inicnching dye, re 
suiting hi j ii in urease of the report or clyo fluores- 
ce. n1 ciiii.f.tluii djXrCtro. PCR primers unci probun 
were de»ignttd ft** I lie human factor VJ1J se- 
quence and human p-actln gene (a.t dcutcnlxrd in 
Methods). Optimization reactions were per- 
formed to choose the approprluto probe and 
-magnesium concentrations yielding the hiKhe^i 
Intensity "of reporter fluorescent signal without 
suerlfJciJig specificity. The Inst rumen I uses a 
charKi:-couplr.d device (I.e., CCD eaineru) for 
measuring the fluorescent emission apeetni /rom 
, r jOO n> C$0 nm. Mach TC:n tube was monitored 
sequentially for 25 msec with continuous moni- 
torinjE; thnni^hout th« anii>Hfienlitm. Uftcli tube 
was rr.-cxan dried every B*5 sec. Computer x>fl- 
wttre. was designed U> examine the fluorescent In* 
tensity of both the reporter dye (PAM).and 
the quenching dye (TAMRA). The fluorescent 
intensity of the quenching dye, TAMUA, chajiges 
very Utile over the course of the PCJ* amplifi- 
cation (data not shown). Therefore, the Intensity 
of TAM11A dye emission serves hx mm hrtcrnal 
.%loiidard wltli which lo nornmll/e the reporter 
dyi: (FAM) cm I AA I on variatjous. 7*he AoftWnre etil- 
eulalc» u value termed ARn (or kkQ) uslny; the 
following equation: ARn - (Iln J ) (R»'), where 
Kn 4 cTnl!f«ion intcjishy u( reporter/emission in- 
tensity uf quencher at any given lime In a rone, 
don tube, and Ru remission intensitity of re- 
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porter/ o.misslOT) hitemity »f qucnciurr measured 
prif >r 10 TCK iunplilication in Hint same reaction 
tul>c. l ; or the purpose of quantitation, the Um 
three data points (AKm) collected iluring the ex* 
tension step for each 1 J CH< cycle were annly/.ed. 
Thenucleolytic degradation of the hyuricli/attion. 
probe occurs during the cxtciisjun phase of I'Ut, 
and, thiereforo, reporter fluorescent cnimjun in- 
creases during this time. 'Jin: iluw data points 
were averaged for each KJk cycle and the mean 
value for each was plotted in an "amplification 
plat" shown In J'tS ure iA - Thv ^Kn mean vahu* is 
l?U>Ued on the y-axis, and time, represented by 
cycle number, is plotted on tljv.x-&xi£. During the 
early cycles of the PCU amplification, tin- ARn 



value remains at base line When sufficient hy- 
bridization probe has been cleaved by the Tun 
jx^ymcrase nucleate activity, the intensity of re. 
porter fluorescent emission increases.. MoM 1>0U 
ainptifiuMlons reach » plateau phoMo of roporter 
fluorescein emission if the rcuuliun Is carried nut 
to high cycle numlwis. The a/npli fira lion plot 19 
examined vaiiy in lh« reaction, ut a point thai 
n-prcsonts the lofl phase of product armmula> 
lion. This is done by ussifinlng an aibiUiuy 
ihr'cshoJd ttui is twscd on the variability of the 
base-line did*. In Figure 1 A, the threshold was set 
ui in standard devicilloiiN above the mean of 
Vj&ao line emission calculated from lyden 1 lo 1 fv 
Once the threshold is chosen, the point at which 
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Figure 1 PCR product detection in real time. {A) The. Model 7700 ^uUware will construct amplification plots 
from the extension phase fluorescent emission data collected during the PCR amplification. The standard de- 
viation is determined from the data points collected from the base line of the amplification ploL C, values are 
calculated by determining the poini at which the fluorescence exceeds a threshold limit (usually 10 times the 
Standard deviation of the base line). (B) Overlay ot amplification plots of serially (1:2) diluted human genomic 
DNA samples amplified with p-actin primers. (C) Input DNA concentration of the samples plotted versus C T . All 
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the amplification plot croseco the thrcriholdivcle 
fined as C P C r is reported ua the cycle number a\ 
this point. A* will be demons! rutud; tin* C\ valor 
ij* piediciivt of the quantity of input ti*rgt*.t. 

Cf Values Provide a Quantitative Measurement* o>* 
Inpur Targer Sequences 

Figure IB shows amplification pioU of Jii'ClirYr*-. 

ejil TCR amplifications overlaid. The ampiificu- 
Hons were performed on a 1:2 serial dlhttoft'iti 
human genomic IWA. The amplified targef vww. 
human p octin. The amplification plots drift to 
tJw* right (to higher threshold cycles) ns the input 
target quantity is reduced, 'JTvic \$ expected ho- 
vuxxm wwcHons with f«w«r starting copinx of the 
target molecule require, greater amplification to 
degrade enough probe to attain the Threshold 
fluorescence. An arbitrary threshold of J0"stan- 
dard deviation* above the base line was used to 
determine the G r values. Figure 1C represents the 
C^. value* plotted versus the sample dilution 
value, Each dilution was amplified in triplicate 
Pc:R amplifications and plotted as mean values 
with error bars representing one standard devia- 
rton. The C T values decrease linearly with increas- 
ing target quantity, Thus, C r valuta can be used 
as » quantitative measurement of tin* input target 
number, it should be noted that the amplifica- 
tion plot for the 15.6-ng sample shown In 1'lgurc 
IB does not reflect the same fluorescent rate of 
Increase exhibited by most of the other samples. 
The 15.6-ng sample also aehie.ve.s- endpoint pla- 
teau at a lower fluorescent value than would he 
expected based on the input PNA. This phenom. 
cnon has been observed, occasionally with other 
samples (data not shown) and may be attribut- 
able to. lute cycle inhibition; this hypothesis is 
siil] under investigation. It is important to note 
that the flattened .slope and early plateau do not 
impact significantly the calculated C, value us 
demonstrated by the fh on the line .shown in 
Figure l C, All triplicate amplifies* ions resulted in 
very similar Cr values— the standard deviation 
did not exceed O.sS for any dlJullon. This experi- 
ment contains a > 1 00,000-fold range of Input tar- 
get molecules. Using C v values for quantitation 
permits a much larger assay range than dlrcclJy 
using total fluorescent emission intensity for 
quantitation. The linear range. oi fluorescent in- 
tensity measurement of the AUI Trlam 7700 Se- 

antnj 
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rnoiits over n very large r;iii|»c of rMalivp start li 
UirfcjM quantities. 

Sample Preparation Validation 

Several parameters influence the efhcU : «ry 
PCM amplification: magnesium and sail conce 
tratloii*. reaction conditions (i.e., time wul tin 
poralure), PCU target size and eompositio 
primer sequences, and sample purity. All of tl 
above factors are common to a single VC\{ assa 
except sample to sample purity, in an effort 
validate the method of sample preparation f 
the factor VUl assay, PCR amplification reprodu 
fbih'ty and oil'Iclency ol 10 replicate samp 
pri'lwratioTiR ware examined. After genomic Dh 
was prepared from the 10 replicate samples, ll 
DNA was qunntUatcd by ultraviolet spcciroscop 
Amplifications were performed analyzing p-acti 
>>cth: content til 100 and 25 »t>* of total xunoni 
IWA. Each VCR amplification was performed i 
triplicate. Comparison of C,. values for each tri] 
licate sample show minimal variation based c 
standard deviation and coefficient of variant 
(Tabic 1). Therefore, each ol the triplicate PC. 
amplifications was highly reproducible, dc TMOT 
Slrailng that real time PCli using this tnstrumcr 
tnlion introduces minimal variation Into th 
quantitative PGR analysis. Omiparlsiou of tti 
mean C n values of the 10 replicate sample prep, 
rations also showed minimal variability, indica 
ing that each sample preparation yielded simiU 
results for p-actln gene quantity. The highest C 
difference between any of rhe samples was 0,S 
and 0.7] for the 100 and 25 nj; samples, respei 
lively. Additionally, the am|)liflt:ati on of eacj 
sample exhibited an' equivalent rate of fluorcs 
cent emission intensity change per amount o 
DNA target analyzed as indicated by .simila 
slopes derived from the sample dilutions (Fig. 2) 
Any sample containing an excess of a PCX inhibi- 
tor would exhibit a greater measured p-actln (;., 
vatuc for a given quantity of DNA. In addition, 
mc inhibitor would be diluted along with I In 
sample in the dilution analysis o ; ig, 2), altering 
the expected C r value change. Each .sample am- 
plification yielded a similar result in the analysis, 
demonstrating that this method of sample prepa- 
ration is highly reproducible with regard lo 
sample purity. 

Quantitative Analvsis of a Plasmid After 

yncR ao/ r ?> r wj «c:fcT 7nn?/cn/7T 
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Toblo 1. Reproducibility of Sample Preparation Method 



Sample 



2 
3 
4 
5 



7 
8 
9 
10 

Mean 



100 ng 



18.24 

18.23 

10.33 

1833 

18.35 

1M4 

18.3 

18.3 

18.42 

18.15 

18.23 

18.32 

18.4 

18.38 

18.46 

18.54 

18.67 

19 

18.28 

18.36 

18.52 

18.45 

18.7 

18.73 

18.18 

18.34 

18.36 

18.42. 

18.57 

i B.r>6 

0 10) 



13.27 0.O6 
0.06 

18.34 0.07 

18.23 0.0S 

1B.42 0.04 

18.74 0.24 

18.39 0.12 

18.63 0.16 

1B.29 0.1 

18.55 0.12 

18,<12 0.17 



25 ng 



standard 
m&an deviation CV C T 



standard 
mean deviation 



20.46 
20*55 

0,32 20.5 20.51 0.03 
20.61 
20.59 

03? ?0.41 0.11 
20.54 
20.6 

0.36 20.49 20.54 0.06 
20.48 
20.44 

0.46 20.38 20.43 0.05 
20.68 
20.87 

0.23 20.63 20.73 0.13 
21.09 
21.04 

1.26 21.04 21.06 0.03 
20.67 
20,73 

0.66 20.65 20.68 0.04 
20,98 
20.84 

0.83 20.75 20.86 0.12 
20,46 
20,54 

0.55 20.48 20.51 0.07 
20.79 
20.78 

0.65 20.62 20.73 0.1 



0,90 



20.66 0.19 



CV 

0.17 

0,54 

0,28 

0.26 

0.61 

0.15 

0.2 

0.57 

0.32 

0.-16 
0.94 



tor containing a partial cDNA for human factor 
vi] I, pl-gTM. A scries of trn infections wa.i scit 
up using a decreasing amoun\ of ihc plasinid s (40, 
A, 0.5 , and 0.1 ug). Tvynniy-rour hours post- 
ion, (ota) T5NA was purified from each 
flask ufctrlfe. p-Aclin £cnc quantity wa*chu>en <i* 
a value Tor not'mali/ialiun of ^c.uuMiic DNA con- 
ccnrruUuii fivm each sample, hi Ihls cApeuiiient, 
(i-actin gene content should' remain constant 
relative to coral genomic UNA. Figure shows ihe 
result of the p-actln DNA measurement (100 jig 
total DNA determined by ultraviolet spectros- 
copy) Ot each ^un;ple. Kach s&mple was analyzed 
jii triplicate and the mean p-acun values of 
the triplicates were plotted (error bars represent 

****** c*-»-wlorrf r!fcr*/ial »nn t *l h#» htPlWKT <iiffrrrnrp 



betwguii any iwf> samplci moan it was 0.9 S C,„ Ten 
nanograms of total DNA of each sample were also 
examined for p-actln. The results again showed 
that Very similar amounts Of genomic DNA wore: 
present; the. maximum mean fi actio O, value 
difference waa 1.0. A3 Figure 3 shows, the rate of 
P-actln C r uliiingv heiwc-cn the 100 and 10-iig 
sample* was similar (slope values rangu butwoon 
3.56 and -3.45). This verifies again thill ih« 
method of sample preparation yields scum pi as of 
identical PCR integrity G'.<-* no sample contained 
an excessive ainuuiil of a PCR Inhibitor). How- 
ever, these results indicate that each sample con 
talncd slight differences in the actual amount of 
genomic ONA analyzed. Determination of actual 
ueijoutic ON A loncerri ration was accomplished 
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Figure 2 Sample preparation purity. 1 he replicottt 
camples shown In Table 1 woro ateo amplified In 
tripicate using 2S tig of each DNA sample. The fig* 
oi* shows the input DNA concentration (100 dnd 
25 ng) vs. C,. In 1h#* f ?0i ir#>, ihp 1O0 and 75 ng 
points for «ach sample are connected by a line. 



by plotting the mean (2-iictio C u value obtained 
for uatih 100- ng SMiUpJv un a ffc-actin standard 
i.-uive (shown in Wg. The actual genomic 

DNA concent r:itic»n of each sinnpK-., was ob 
laLncxt by extrapolation to thu jtfaxU, 

l 7 ij;\ire 'I A shows the measured' (t.u., n««» 
normalised) quantities* of factor VI)) pi asm id 
DNA (pI*8TM) from each' of the four transient cell 
lni">fcc*Urms. Each reaction contained 100 n£ of 
total sample DNA (aa detenu hied by UV spectrum 
copy). Vuxch sample w;is analyzed in triplicate 
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Figure 3 Analysb of lidiisfectcd ccJl DNA quantity 
and purity. I he DNA preparations of the four 293 
cell transections (40, 4, 0.5, and 0.1 p.g of pF8TM) 
were analysed for the fl-actln gene. 1 00 and 1 0 ng 
(determined by ultraviolet spectroscopy) of each 
sample were amplified in triplicate. For each 
amount of pF8TM that was transfected, the (i-aciln 
C 7 values are plotted versus the total Input DNA 



TC:U amplifications. As shown, pl : BTM purifier. 

jfioic Jhc 293 cells decreases (mean G, values in 
cuttwtft) with decreasing amounts of plasmlc 

ttrumUxtcd. The mean C 4 values obtained fo 
prUTM in Ttgufc 4A vvore plotted on u standun. 
curve oc«mprl.Hed uf solidly diluted pKHTM 
shown .in figure 4R. Thu quuiillly uJ pKXTM, /j 
found in each of the four trnnsfectlonR w;\s do 
tcrmhied by extrapolation to thu * axle of tin 
standard curve In 1'i^ure 4tt. These uncorreetet 
values, b, lor pKHTM were n or u tailed 10 deter 
mine the actual amount of pJ'8TM Rrviud pur 10f 
riK of genomic DNA by using the equation:. 



M 8 ^ actual pIB'l*M copies per 
f/ r 1<X) ut genomic UNA 



where a •- actual' genomic DNA in u .sample anc 
U pl : H'l*M copies from the standard curve. T)»< 
normal ir-cd quantity of pl'BTM per 100 jig of 
nomic ONA for each of the four 1ra n,Vf ecilon.s 1: 
shown in Figure 4JJ. 'Hicst: rw»uU* a how tliai th< 
qunniMy of factor VUl plasiuUU asswlaled win 
the cells, 21 lir after iruiusfvciicm, dut.iu.ise.* 
with OccrccUiiiK ttiii 1 HO umtAtiUiatiotj u.sed \\ 
the transection. The quantity of pl'B'J'M ««oeJ 
ateu with 293 cells, after trunsfectlon with 40 m 
of piiifiinid f was 35 pg p«r 100 ng genomic UNA 
Tills results in -S20 plasmid copies per cell. 



DISCUSSION 

We have described a new method for qu an that. 

^enc copy numbers using /caMlnftc anulysl! 
of 1>CR axnpflficatlnns. Real-time PCK i* comical- 
ible with cJthttr of the two PCK (KT-PCR) ap- 
proaches: (1) quantitative compelitiw where an 
Intcinal eompcUlor for each target sequence is 
used for nomiaHxatJon (data not shown) or (2) 
quantiiaiivc comparattvt; PCR using u liuiuicili^cv 
tioix gene contained within the sample (J.e., f3-nc- 
tin) ox a "housekeeiJing" gene for HT-PCK. Ff 
equal amounts of nucleic nod are anaiy/.cd fcjr 
each sample and if the amplification eff kidney 
before quantitative analysis identical for each 
sample, the vnrcniul cojjIkjI ( norma liroi ion gene 
or competircar) should give equal »i>;n&I& for all 
samples. 

The real-time PCU method offers scveml ad. 
vaniages over the other two methods currently 
employed (sec the Introduction). Kirst, the real- 
time P.CR method is performed in a dosed-tube 
system and requires no pnst-PCR manipulation 
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D "™it Qi,f>nt ? tiVC ° n0ly,:i ' f 0f P F8 ™ in Iranrfccttd cclb. M) Amount of 
plasmid DMA used for I he trnnsfecilon plotted against U.e n, Vi >„ c, vafue deter- 
To , ™™'" J "* M hr alter transection. <0,Q Standard curvn* of 

dinLw^irT. u , WpCCl1 ^- Pf8 ™ DNA <*> 6nd Scnomic DNA (Q were 
diluted ^rfally 1 ;S befor* amplification with the appropriate primer*. Trie B-actin 

™ Curv * Wi * USQd lo nom^f,Vc Ihc results of A to 1 00 no of genomic DMA. 
(O) Tho amount of pPSTM present p«:r 100 ng of genomic DNA 



of sample. Therefore, jlu- potential for PCR con* 
lamination in the laboratory is reduced because 
amplified product* cam b« analysed and disponed 
of without opening the reaction tubes. Second, 
this method suppoiU i)i\£ umi of a iiortn;iIiy.- ( itk>n 
gone (U., (3-aciin) /or quant itative. PCR or house- 
keeping genes for quantitative RT-l'CK controls. 
Analysis Js performed in real time during ihe Jog 
phase of product accumulation. Analysis during 
k>K phase permits many different genes (over a 
wide input target range) to be anaJy>.cd simulta- 
neously, without concern of reaching reaction 
plateau at different cycle*, Tim will make uiultl- 
jgent analysis assays much caMe.i to develop, be- 
cause individual internal competitor will not be 
needed for each gene under anoJyals. Third, 
sample throughput will inn ease dramatically 
with the new method because, there is no post- 
PCR processing time. Additionally, winking In a 
DC-well format to highly compatible with auto, 
uicttion technology, 

The real-time PCR method is highly reprn. 
dueible. Replicate, amplifications can be analyzed 



for each sample iiunhnlsdng potential error. The. 
sysU!tn allows- 1'ot a very large assay dynamic: 
range (uppToae.hing 1,000,000- fold Mailing Uu- 
gel). UaiiiK u .standard eurve for the. target oi in- 
terest, relative copy number values can be deter- 
mined for any unknown sample. Fluorescent 
threshold values, C, r con da ir. linearly with rela- 
tive 1>NA copy numbers. Real time quanlllaiive 
KT-PCK methodology (Gitalin et al„ this Issue*) 
has also been developed, finally, real time quan- 
titative PCK methodology can be used it; develop 
high-throughput screening assays for a variety of 
applications [quantitative gene capieaaiuii (KT- 
PCR), Rene copy nibwya <Mcr2, I11V, etc.), £cm> 
typing (knockout mouse anaiysia), and hnniunu- 

ponj. 

Rcal-thne POll may also he j^crformrd using 
intercalating dyes (Htguchi d al. ISWl) such us 
ciJWdiurn bromide. The fluorogenic probe, 
method offers a ma|or advantage over inter- 
calating dyes- greater specificity (i.e., primer 
d I mere and nonspecific PCR products are. not de.- 
tftrted). 
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METHODS 

Generation of a Plasmid Containing a Partial 
cDNA Tor Human Factor YI11 

Tuttd RNA run harvcMcd (UN Awl H Oxmt Tot Tcsir Inc., 
rfK-"ClSYVOOd, T>t) /rum cu)t> it-oafccled with a factor VI 11 
expression v^clor, pClSZ.Uc?.£l> (Katun el dl. WHO; Gor. 
man ci at. 1900V A factor VII! partial cl »N A wpientv WftS 
rtcn*ir«ic«i by HT |<:<m©Amp K'/ tTlh Ittf A t>r.U Kb 
(pan NoOK-cnvs, Applied biosy&tcms, Vmwt CAiy, <-IA)J 

u>liif, »be l ,(:u |»»*iun:f* PHfor *"»t! l-'flrcv (|»riTtwr M'queiire* 

are shown below). Tin' ampllcon was reamplified aMnR 
modified t'ofor and Wrcv primers (uppcmled with Mamlll 
and WmdUl rcslrlclJon site sequence* »t iliv y e»wlj and 
cloned Into po'KM- 32 (Promt^n corp., Muuuon, Wi). The 
resulting clone, pPSTM, was uwil lor transient iransfcalon 
of %M cells. 



Amplification of Target DNA ami Dciecilpn of 
Amplicon Factor VIII Mastoid DNA 

(pt'BTM) was tnni»Mf\wl Willi lhv t«imvii VHUa Z'-CAX:- 

cn'(K;<^\ACiAu:ixjAtxiicrrcv-3' and p»rev 5'-AAAc;crr- 

lUCCXrroCiA'lXajTAC'iCi-.V.IIiti rvnclUui piodutrd if 42a- 
up i*e:K product. The fnrwurd primer wua de^ljjnud to n.v 
ngntxe u uiiImui? M'tpii'iui- found In the 5' untranslated 
region of thu put em pC152.tkZ5J> |i|(tMiti\i ntnl lhi:rcfo/e 
tlovs nut urn i^i jUu <md amplify the human factor VIM 
gems Priinnr* woro chosen with the av*i*t«Mir(* of I he oom- 
pulcr program Oli^o HI (Nuiiimul liiuffciencct, lm\, Ply. 
mouth, MN). The human p-acthi gptir wws amplified with 
the prliiico fl-fu iiii forward primer S'-TCAOCOAClACrCT 
GCCCATCTI ACXiA-.T and fJ-actiu reverse piimcr V.CIAC;. 

CGGAACCX;c:ix:atixx:c:aatgG-3\ The reaction pro- 
Oucco a ZVS-hp rC.U product. 

Amplification' reactions (SO ^J) contained a DNA 
sample. HI* I'C :n Huffur II {ti >xl), 200 jiw dAlT, dOVr, 
dGTP, and 400 »im rill'IT, 4 inM MgCI ? , 1.25 Units Ampil 
7>it; r;NA polymerase, 0,5 unit Amprjnsc uracil N-Riy* 
t:t»,tyluw <UNC), 60 pmolv of ench faciei VIII jirlinvi, and 15 
pi noli* of out'.) i p *ctln pibnor. Tho icoi-ttuiv^ u\m> contained 
Otic Of the following defection probe* (WM1 nw rprh): 

]^j>rt.»bc A'd/AwjAcicrrcrrcicuccrrcjcrn'crri-rcTCT- 

GCCTT(TAMRA)p 3' nud 0-at-4in probe 5 f (PAM)ATGC^« r > 
XG'AMKAJCCCCC/TGr.r.ATCp-.T where p indicates 
phoflphnryUiiAn nrvd X IndlcotcA a linker arm nucleotide. 
Reaction tubes wrrv MicmAmp Optkul TviIks (part Hum* 
1.ht Nttni 00.t.\ Pcrldn Ulnier) tliat wore frotlcO ( W | )Vr|;lu 
nimcr) lo prev«-»»l light from /cflcciln^i Tube capi were 
simitar in MirmAtVip t^nps but .ipeciaJIy desiRiied to pre- 
vein IIrIiI seal lennjj,. All ol th<* \K'M omiMutnhlvU>« wcru »xi>** 
I/bud Ivy PK Applied lUosy»t«ni9 (l ! <»»bT f!Ky, CX) except 
ihr factor VI 11 prlmera, vwlili'H wm* nynlbesiyxd at Cenen 
lech, Inc. (Soul 1 1 f'tm Pr&n clsco, CA). Probev dt-sl^at-d 
u^lng the Oliyo ^.<) software, A>1Lmv||^; gtildellitcs aij;- 

jiC5ic<i in me Model 7700 .Sequence Detector lii.tiiuinml 
manual, briefly, prube T m ^ImhiM Iw a! least 5 U C hlnl^cr 
mail the hhik-uIIhk Itrmpvuiltm: U5cd durlnj; ihrrmul cy- 
rhtig; primcrii slto\ilil not /unit sUhic duplexed wit li the 
probe. 

The ihcriiifti cycling cunditkuvs Included 2 juIji at 
50 V C and 10 niin at 95*C. Tlirxmal cycling prnrrrdVd with 
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reactions were performed in th<» Morlrl 770t) Sequence IV- 
U^icir (PE Applied UlustyiAviuw), urhlrh cuiuaUu i» Ccor - 
Amp »*<":U Sy &t vm P«XX lU:at:lloa condilioi^ w«*rr- ptn- 
^ruili)UL*d on »i I'vtwvr Macintuflh V10CJ (Apple f.V.ini] nil nr 
Ronta Claru, CCA) linked dirvcily w the Model V/0o 
c)uvskv IXilocior. AiwtyvU of data w>« oIko perforrrwd or 
thv W»»flnio*h eompvitcr, f'.oltnrtloii and analytk tofiwar* 
wi\% developed Ml PK Applied BIcKytluihs. 

Transection of Cells with Factor Vlll Convtruci 

Pour T17.S nasks of 293 cells (ATCc: CAW. 1573), a humat 
fetol kidney suspension cell line, wvrc grow i io 80% con 
llucney and trantfected pl-VI'Ki. Cells were (jr«wn in tin 
follnwitig nicdln: 50% HAM'X HI 2 without GUT, Stw> Uw 
filuco3« DiUlKHH.Ni's modified Kaxlo medium (I)MIIM) will) 
enn Rlynnu wiUi sodium bicarlxmate, 10% lew I bnvim 
serum, 2 ium L-^luld»mnr, and 1% peniollin-atrrptomy 
tin. The media was cliangcd 30 rnin kAw the Iransfcc 
Lion, pPUTM DKA amount* of 40, 4, 0Jv,nnd 0.1 wun 
itdded iu 1..S ml of a solution containing 0,125 m CmCI 
and 1 X I1KPKS. Tile four niixhrnn were left at rt>om ten' 
pcrw\«rv (tn Mi mln and iheti acUd^l HnipwLu* lc> the cells 
Tlrv nw>)i* wvm- .intubated at 37 ft C and ,s% C:O s for 24 hr 
washed with V1MSL, rtwuspended In PUS. The K'Kiifl 
jn-nd^l cc\U were divided into tditjooU and UNA wnd ts 
tnicted ItTimedlutuly usinR iheQIAa/n]) Klomi Kit (QUgen 
Ojot?m>rlii, <.iA), DNA w»,s cluled Into 200 y>[ vl 20 
TWs-Ua ul pll B.0, 
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Contributed by David Bot stein and Arnold J. Levine, October 21, 1998 

ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (0 C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracylihe repressible promoter, and («) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q24 j. WISP-1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISP-3 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to > 30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3/3 (GSK-30) resulting in an increase in 
0-catenin levels. Stabilized j3-catenin interacts with the tran- 
scription factor TCF/Lef 1, forming a complex that appears in 
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the nucleus and- binds TCF/Lef 1 target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
0-catenin levels (9). APC is phosphorylated by GSK-3/3, binds 
to /3-catenin, and facilitates its degradation. Mutations in 
either APC or /3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-/3 superfamily, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwn), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
. Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP-2, and a third related gene, WISP-3. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling, 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA ■ 
. Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor, CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank. database (accession nos. AF1 00777, 
AF100778, AF100779, AF100780, and AF100781). 
1To whom reprint requests should be addressed, e-mail: diane@gene. 
com. 
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cDNA was synthesized from 2 jig of poly(A) + RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 pig 
of poly(A)"*" RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 tiM of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and gJyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WlSP-2. All tissues were processed- as described (40). 

Radiation Hybrid Mapping, Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2< Act) where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
3-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The W/S/'-specific signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
l-inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no matchi To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2 t were differentially 
expressed, being induced in the C57MG/ Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on /3-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5 -fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-1. The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of ^40,000 (M T 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosyiation sites 
and are 84% identical (Fig. 2A). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of 27,000 (M T 27 K) (Fig. IB). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosyiation sites, and mouse WISP-2 has one at 

C57MG 



Parem Wni-1 Wnt-* 




Fig. 1. WISP-1 and WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 (A) and 
WJSP-2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 fig) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP- 1 -specific probe 
(amino acids 278-300) or a 190-bp W75/ > -2-specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human 0-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-l (A) and mouse and human WISP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-l. 

Identification of WISPS. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-l protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISPS cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354 r aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyU 
ation sites and 36 cysteine residues. An alignment of the three 
* human WISP proteins shows that WISP-l and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-l and 32% identity with WISP-3 (Fig. 3/1). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human. WISPS, WISP-2, and WISPS are novel sequences; 
however, mouse WISP-l is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-1 (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogen ic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overex pressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, .similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 35) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor" (IGF)- 
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Fig. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-l and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-l has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oiigomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconju gates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1. expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WIS 2*2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISPS was seen in adult 
kidney and testis and fetal kidney. Lower levels of W1SP-3 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISPS, Expression of 
WISP-1 arid WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
1 evel WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1 \ WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 £-//)• However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 




Fig. 4. (A, C, £, and G) Representative hematoxylin/ eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power {A and B) t 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (£ and /^.expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 



the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human. WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISP-3 is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig^ 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
. assessed by quantitative PCR and Southern blot analysis, (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P - 
0.166). In addition, the copy number of WISPS was signifi- 
cantly higher than that of WISP-1 {P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 
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Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer ceil 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 u.g) 
digested with EcoRI (WISP-1) or Xba\ (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Tumor Number 

Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data, are means ± SEM .from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1 , WISP-3 RNAwasoverexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 7. RNA expression in primary human colon tumors 

relative to expression in normal mucosa from the same patient 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. ITie Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISPS ranged from 
4- to >40-fold. 

DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-1, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-L 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnkl and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., /3-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-1- transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through /3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-/3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a v ft serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast rumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
- tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-/31, which is the stimulus for 
stromal proliferation (34). TGF-/31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression; whereas overexpression of 
WISP-3 RN A was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression. in normal 
colonic mucosa from the same patient. The gene for human 
WISP-'2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the .20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and j3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumori genesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development . 
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methods. Peptides AENK or AEQK were dissolved in water, made isotonic with 
NaCl and diluted into RPM1 growth medium. T-cell-proiiferation assays were 
done essentially ai described 20 ' 11 . Briefly, after antigen pulsing (30u.gmr' 
TTCF) with tetrapeptides (l-2mgml"'), PBMCs or EBV-B cells were 
washed in PBS and fixed for 45 s in 0.05% glu tar aldehyde. Glycine was added 
to a final concentration of 0.1 M and the cells were washed five times in RPMI 
1640 medium containing 1% FCS before co-culture with T-cell clones in 
round-bottom 96-well microtitre plates. After 48 h, the cultures were pulsed 
with I u.Ci of 3 H- thymidine and harvested for scintillation counting 16 h later. 
Predigestion of native TTCF was done by incubating 200 ftg TTCF with 0.25 u.g 
pig kidney legumain in 500 u.1 50 mM citrate buffer, pH 5.5, for 1 h at 37 °C. 
Glycopeptide digestions. The peptides H1DNEEDI, HIDN(AT-glucosamine) 
EEDI and HIDI^fESDI, which are based on the TTCF sequence, and 
QQQHLFGSNVTDCSGNFCLFR(KKK), which is based on human transferrin, 
were obtained by custom synthesis. The three C- terminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGSNVTDCSGNFCLFR was prepared by tryptic (Promega) digestion 
of 5 mg reduced, carboxy- methylated human transferrin followed by 
co nca naval in A chromatography 1 1 . Clycopeptides corresponding to residues 
622-642 and 421 -452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N-terminal sequencing. The lyophilized transferrin - 
derived peptides were redissolved in 50 mM sodium acetate, pH 5.5, 10 mM 
dithiothreitol, 209b methanol. Digestions were performed for 3 h at 30 °C with 
5-50 mUml"' pig kidney legumain or B-cell AEP. Products were analysed by 
HPLC or MALDI-TOF mass spectrometry using a matrix of 10 mg ml" 1 a- 
cyanocinnamic acid in 50% acetonitrile/0.1% TFA and a PerSeptive Biosy stems 
Elite STR mass spectrometer set to linear or reflector rnode. Internal standar/T 
dization was obtained with a matrix ion of 568.13 mass units. 
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1. Chen, J. M. et at. Cloning, isolation, and characterisation of mammalian legumain, an asparaginyl 
endopeptidase. /. Biol Chem, 272, 8090- 8098 (1997). 

2. Kcmbhavi, A. A« Buttle, D. J., Knight, C C. fit Barrett, A. ). The two cysteine endopeptidases of legume 
seeds: purification and characterization by use of specific fluorometric assays. Arch. Biochem. Biophys. 
303. 208-213 (1993). 

3. Dalton, ). P., Hoi J Jamriska, L & Bridley, P. J. Asparaginyl endopepridase activity in adult Schistosoma 
mansoni. Parasitology 111, 575-580 (1995). 

4. Bcnneu, K. et at. Antigen processing for presenution by class II major histocompatibility complex 
requires cleavage by uthespin E. Eur. /. Immunol 22, 1519-1524 (1992). 

- 5. Rtesc, It ). tt at. Essential role for cathepsin S in MHC diss ll-assoeiated invariant chain processing 
and peptide loading. Immunity A, 357-366 (1996). 

6. Rodriguez. G- M. fit Dimcnt, 5. Role of cathepsin D in antigen presentation ofovalbumin. /. Immunol 
149, 2894- 2B98 (1992). 

7. Hewitt, E. W. et ai Natural processing sites for human cathepsin E and cathepsin D in tetanus toxin: 
implications for T cell epitope generation. /. Immunol 159. 4693-4699 ( 1997). 

8. Watts, C Capture and processing of -exogenous antigens for prcsenutton on MHC molecules. Annu. 
Rev. Immunol. 15, 821-850 (1997). 

9. Chapman, H. A, Endosomal proteases and MHC class II function. Curr. Opin. Immunol 10, 93- 102 
(1998). 

10. Fineschi. B. & Miller, I. Endosomal proteases and antigen processing. Trends Biochem. Scl 22, 377-382 

(1997) . 

11. Lu. J. fit van Halbeek. H. Complete 1 H and "C resonance assignments o f a 2 1 •amino acid g] ycopeptide 
prepared from human serum transferrin. Carbohydr. Res. 296. 1-21 (1996). 

12. Fearon, 0. T. fit tocksley, R. M. The instructive role of innate immunity in the acquired immune 
response. Science 272, 50-54 ( 1996). 

13. Medzhitov, R. & Janeway, C A. ). Innate immunity: the virtues of a nondonal system of recognition. 
Cell 91, 295-298 (1997). 

14. Wyatt. R. et at. The antigenic structure of the HIV gp 120 envdope glycoprotein. Nature 393, 705-71 1 

(1998) . 

15. Botarelli. P. et al N-glycosytation of HIV gpl20 may constrain recognition by T lymphocytes. /. 
Immunol 147,3128-3132 (1991), 

16. Davidson, H. W.. West. M. A. fit Watts, C Endocytosis. intracellular trafficking, and processing of 
membrane IgG and monovalent antigen/membrane IgG complexes in B lymphocytes. /. Immunol 
144. 4101-4109(1990). 

17. Barren. A. J. fit Ktrschkc H. Cathepsin B. cathepsin H and cathepsin L Methods Enzymol. 80, 535-559 ' 
(1981). 

18. Makoff, A. I, Bait an tine, 5. P., Small wood, A. E. fir Fairweather, N. F. Expression of tetanus toxin 
fragment C in E. coih iu purification and potential use as a vaccine. Biotechnology 7, 1043- 1046 
(1989). 

19. Lane. D. P. & Harlow, E. Antibodies: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 
1988). 

20. Laniavecchia. A. Antigen-specific interaction between Tand B cells. Nature 314, S37-S39 (1985). 

21. Pond. L fit Wans, C Characterization of transport of newly assembled, T cell -stimulatory MHC dass 
1 1 -peptide complexes from MHC dass II compartments to the cell surface. /. Immunol 159, S43-553 
(1997). 

Acknowledgements. We thank M. Ferguson for hdpful discussions and advice; E.Smythe and L Grayson 
for advice and technical assistance; B. Spruce, A. Knight and the BTS (Ninewelb Hospital) for hdp with* 
blood monocyte preparation; and our colleagues for many helpful comments on the manuscript. This 
work was supported by the Wellcome Trust and by an EM BO Long-term fellowship to B. M. 

Correspondence and requesu for mauriab should be addressed to CW. (e-maib cwatts@dundee.acuk). 



Genomic amplification of a 
decoy receptor for Fas ligand 
in lung and colon cancer 

Robert M. Pitti*t, Scot A. Marsters*t, David A. Lawrence*!, 
Margaret Roy*, Frank C. Kischkel\ Patrick Dowd*, 
Arthur Huang*, Christopher J. Donahue*, 
Steven W. Sherwood*, Daryl T. Baldwin*, Paul J. Godowski*, 
William I. Wood*, Austin L. Gurney*, Kenneth J. Hillan*, 
Robert L. Cohen*, Audrey D. Goddard*, David Botstein* 
& Avi Ashkenazi* 

* Departments of Molecular Oncology, Molecular Biology, and Immunology, 
Genentech Inc., 1 DNA Way, South San Francisco, California 94080, USA 
t Department of Genetics, Stanford University, Stanford, California 94305, USA 
t These authors contributed equally to this work 



Fas ligand (FasL) is produced by activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target cells through the death receptor Fas/Apol/CD95 (ref. 1). 
One important role of FasL and Fas is to mediate immune- 
cytotoxic killing of cells that are potentially harmful to the 
organism, such as virus-infected or tumour cells'. Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumouFi studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent immune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gene superfamily 2 . Using 
the overlapping sequence, 'we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members of 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteine-rich domains 
(CRDs). Like one other TNFR homologue, osteoprotegerin (OPG) 3 , 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane-asscociated, 
molecule. We expressed a recombinant, histidine-tagged form of 
DcR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 shares 
sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carboxy-terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). We detected a predominant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and lung, in addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
DcR3-Fc to human 293 cells transfected. with individual TNF- 
family ligands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL 4 (Fig. 2a), but not to cells transfected with 
TNF 5 , Apo2L/TRAIL 6,7 , ApoSL/TWEAK** 9 , or OPGL/TRANCE/ 
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RANKL 10 " 13 (data not shown). DcR3-Fc immuno precipitated shed 
FasL from FasL-transfected 293 cells (Fig. 2b) and purified soluble 
FasL (Fig. 2c), as did the Fc-tagged ectodomain of Fas but not 
TNTR1. Gel-filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a comparable affinity (K d — 0.8 ± 0.2 and 
1.1 ±0.1nM, respectively; Fig. 2e), and that DcR3-Fc could 
block nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e ( 
inset). Thus, DcRJ competes with Fas for binding to FasL. 

To determine whether binding of DcR3 inhibits FasL activity, we 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig. 3a). DcR3- 
Fc and Fas-Fc blocked soluble-FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
—0.1 u-gmr 1 . Time-course analysis showed that the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3-Fc on activation- 
induced cell death (A1CD) of* mature T lymphocytes, a FasL- 
dependent process 1 . Consistent with previous results 13 , activation 
of interleukin-2-stimulated CD4-positive T cells with anti-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 



induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granzymes 1,14 "'*. Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Fc 
and Fas-Fc each reduced killing of target cells from -65% to 
— 30%, with half-maximal inhibition at —1 M-gml" 1 ; the residual 
killing was probably mediated by the perforin/granzyme pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fc and Fas-Fc concentrations were required 
to block natural killer cell activity compared with those required to 
block soluble FasL activity, which is consistent with the greater 
potency of membrane-associated FasL compared with soluble 
FasL 17 . 

Given the role of immune-cytotoxic cells in elimination of 
tumour cells and the fact that DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immune-cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesis, we 
investigated whether the DcR3 gene is amplified in cancer. We 
analysed DcR3 gene -copy number by quantitative polymerase chain 
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Figure 1 Primary structure and expression of human DcR3. a, Alignment of the 
amtno-acid sequences of DcR3 and of osteoprotegerin (OPG); the C-terminal 101 
residues of OPG are not shown. The putative signal cleavage site (arrow), the 
cysteine-rich domains (CRD 1 -4), and the AMinked glycosylation site (asterisk) are 
shown, b. Expression of DcR3 mRNA. Northern hybridization analysis was done 
using the DcR3 cDNA as a probe and blots of poly(A)* RNA (Clontech) from 
human fetal and adult tissues or cancer cell lines. PBL peripheral blood 
lymphocyte. 



Figure 2 Interaction of DcR3 with FasL a, 293 cells were transfected with pRK5 
vector (top) or with pRK5 encoding full-length FasL (bottom), incubated with 
DcR3-Fc (solid line, shaded area), TNFRi-Fc (dotted line) or buffer control 
(dashed line) (the dashed and dotted lines overlap), and analysed for binding by 
FACS. Statistical analysis showed a significant difference (P < 0.001 ) between the 
binding of DcR3-Fc to cells transfected with FasL or pRK5. PE, phycoerythrin- 
labelled cells, b, 293 cells were transfected as in a and metabolically labelled, and 
celt supernatants were immunoprecipttated with Fc-tagged TNFR1, DcR3 or Fas. 
c, Purified soluble FasL (sFasL) was immunoprecipitated with TNFR1-Fc, DcR3- 
Fc or Fas-Fc and visualized by immunoblot with anti-FasL antibody. sFasL was 
loaded directly for comparison in the right-hand lane, d, Rag-tagged sFasL was 
incubated with DcR3-Fc or with buffer and resolved by gel filtration; column 
fractions were analysed in an assay that detects complexes containing DcR3-Fc 
and sFasL-Flag. e, Equilibrium binding of DcR3-Fc or Fas-Fc to sFasL-Fiag. 
Inset, competition of DcR3-Fc with Fas-Fc for binding to sFasL-Flag. 
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reaction (PCH) 18 in genomic DNA from 35 primary lung and colon 
tumours, relative to pooled genomic DNA from peripheral blood 
leukocytes (PBLs) of 10 healthy donors. Eight of 18 lung tumours 
and 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18-fold (Fig. 4a, b). To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
of DcR3 -based PCR primers and probes; we observed nearly the 
same amplification (data, not shown). 

We then analysed DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization. We detected DcR3 expres- 
sion in 6 out of 15 lung tumours, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and 1 out of 1 gastric tumour (data not shown). A 
section through a squamous-ceil carcinoma of the lung is shown in 
Fig. 4c. DcR3 mRNA was localized to infiltrating malignant epithe- 
lium, but was essentially absent from adjacent stroma, indicating 
tumour-specific expression. Although the individual tumour speci- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DcR3 gene is amplified frequently in tumours. 
SW480 colon carcinoma cells, which showed abundant DcR3 
mRNA expression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, as shown by quantitative PCR (fourfold) and by Southern blot 
hybridization (fivefold) (data not shown). 

If DcR3 amplification in cancer is functionally relevant, then 
DcR3 should be amplified more than neighbouring genomic 
regions that are not important for tumour survival. To test this, 



C 
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we mapped the human DcR3 gene by radiation-hybrid analysis; 
DcR3 showed linkage to marker AFM2 18xe7 (T160), which maps to 
chromosome position 20ql3. Next, we isolated from a bacterial 
artificial chromosome (BAC) library a human genomic clone that 
carries DcR3> and sequenced the ends of the clone's insert. We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and of 
seven genomic markers that span chromosome 20 (Fig. 4d). The 
DcR3 -linked reverse marker showed an average amplification of 
roughly threefold, slightly less than the approximately fourfold 
amplification of DcR3; the other markers showed little or no 
amplification. These data indicate that DcR3 may be at the 'epi- 
centre' of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with the possibility that DcR3 amplification 
promotes tumour survival. 

Our results show that DcR3 binds specifically to FasL and inhibits 
FasL activity. We did not detect DcR3 binding to several other TNF- 
ligand- family members; however, this does not rule out the possi- 
bility that DcR3 interacts with other ligands, as do some other 
TNFR family members, including OPG 2 *' 9 . . 

FasL is important in regulating the immune response; however, 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFLIP, which modulates apoptosis signal- 
ling downstream of Fas 20 . A second mechanism involves proteolytic 
shedding of FasL from the cell surface 17 . DcR3 competes with Fas for 
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Figure 3 Inhibition of FasL activity by DcR3. a, Human Jurkat T leukaemia cells 
were incubated with Rag-tagged soluble FasL (sFasL. 5 ng ml"') oligomerized 
with anti-Flag antibody (0.1 ligmf 1 ) in the presence of the proposed inhibitors 
DcR3-Fc, Fas-Fc or human IgGl arid assayed for apoptosis (mean ± s.e.m. of 
triplicates), b Jurkat cells were incubated with sFasL-Flag.plus anti-Flag antibody 
as in a, in presence of 1 M-g rnl~' DcR3-Fc (filled, circles), Fas-Fc (open circles) or 
human IgGl (triangles), and apoptosis was determined at the indicated time 
points, c, Peripheral blood T cells were stimulated with PHA and interleukin-2, 
followed by control (white bars) or anti*CD3 antibody (rilled bars), together with 
phosphate-buffered saline (PBS), human IgGl. Fas-Fc. or"DcR3-Fc (lu>gmr l ). 
After 16 h, apoptosis of CD4* cells was determined (mean ± s.e.m. of results from 
five donors), d, Peripheral blood natural killer cells were incubated with 5, Cr- 
labelled Jurkat cells in the presence of DcR3-Fc (rilled circles), Fas-Fc (open 
circles) or human IgGl (triangles), and target-cell death was determined by 
release of 51 Cr (mean ± s.d. for two donors, each in triplicate).- 
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Figure 4 Genomic amplification of DcR3 in tumours, a, Lung cancers, comprising 
eight adenocarcinomas (c, d, f, g. h, j, k, r), seven squamous-cell carcinomas (a, e. 
m, n, o. p. q), one non-small-cell carcinoma (b). one small-cell carcinoma (i), and 
one bronchial adenocarcinoma (I). The data are means ± s.d. of 2 experiments 
done in duplicate, b, Colon tumours, comprising 17 adenocarcinomas. Data are 
means ± s.e.m. of five experiments done in duplicate.- c, In situ hybridization 
analysis of DcR3 mRNA expression in a squamous-cell carcinoma of the lung. A 
representative bright-field image (left) and the corresponding dark-field image 
(right) show DcR3 mRNA over infiltrating malignant epithelium (arrowheads). 
Adjacent non-malignant stroma (S). blood vessel (V) and necrotic tumour tissue 
(N) are also shown, d. Average amplification of DcR3 compared with amplifica- 
tion of neighbouring genomic regions (reverse and forward, Rev and Fwd), the 
OcR3-linked marker T160. and other chromosome-20 markers, in the nine colon 
tumours showing DcR3 amplification of twofold or more (b). Data are from two 
experiments done in duplicate. Asterisk indicates P < 0.01 for a Student's Hest 
comparing each marker with DcR3. 
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FasL binding; hence, it may represent a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interleukin- 1 has been 
described 21 . In addition, two decoy receptors that belong to the 
TNFR family, DcRl and DcR2, regulate the FasL-related apoptosis- 
inducing molecule Apo2L 22 . Unlike DcRl and DcR2, which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR-family member is 
OPG\ "which shares greater sequence homology with DcR3 (31%) 
than do DcRl ( 17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L". Thus, DcR3 and OPG define a new subset of 
TNFR-family members that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNF-family ligands, 
thereby modulating the antiviral immune response 2 . Our results 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours. Q 

Methods 

Isolation of DcR3 cONA. Several overlapping ESTs in GenBank (accession 
numbers AA025672, AA025673 and W67560) and in Lifeseq™ (Incyte 
Pharmaceuticals; accession numbers 1339238, 1533571, 1533650, 1542861, 
1789372 and 2207027) showed similarity to members of the TNFR family. We 
screened human cDNA libraries by PCR with primers based on the region of 
EST consensus; fetal lung was positive for a product of the expected size. By 
hybridization to a PCR-gen crated probe based on the ESTs, one positive clone 
(DNA30942) was identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, we isolated 50 
more clones; the coding regions of these clones were identical in size to that of 
the initial clone (data not shown). 

Fc-fusion proteins (immunoadhesins). The entire DcR3 sequence, or the 
ectodomain of Fas orTNFRl, was fused to the hinge and Fc region of human 
lgGl, expressed in insect SF9 cells or in human 293 cells, and purified as 
described". 

Fluorescence-activated cell sorting (FACS) analysis. We transfected 293 
cells using calcium phosphate or Effectene (Qiagen) with pRK5 vector or pRK5 
encoding full-length human FasL 4 (2 u.g). together with pRK5 encoding CrmA 
(2 jig) to prevent cell death. After 16 h, the cells were incubated with 
biotinylated DcR3-?cor TNFRl-Fc and then with phycoerythrin -conjugated 
streptavidin (GibcoBRL), and were assayed by FACS. The data were analysed by 
Kolmogorov-Smirnov statistical analysis. There was some detectable staining 
of vector-transfected cells by DcR3-Fc; as these cells express little FasL (data 
not shown), it is possible that DcRJ recognized some other factor that is 
expressed constitutively on 293 cells, 

Immunoprecipitation. Human 293 cells were transfected as above, and 
metabolically labelled with [ 35 S] cysteine and ( 3S S] methionine (0.5 mCi; 
Amersham). After 16h of culture in the presence of z-VAD-frnk (IOjlM), 
the medium was immunoprecipitated with DcR3-Fc, Fas-Fc or TNFRl-Fc 
(5^g), followed by protein A-Sepharose (Repligen). The precipitates were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified, Flag-tagged soluble FasL (1 (Alexis) was incubated 
with each Fc-fusion protein ( 1 p.g). precipitated with protein A-Sepharose ( 
resolved by SDS-PAGE and visualized by immunoblotting with rabbit anti- 
FasL antibody (Oncogene Research). 

Analysis of complex formation. Flag-tagged soluble FasL (25p.g) was 
incubated with buffer or with DcR3-Fc (40 u-g) for 1.5 h at 24 °C The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and developed 
with PBS; 0.6- ml fractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysed by placing 1 00 |xl aliquots into microti tre 
wells precoated with anti-human IgG (Boehringer) to capture DcR3-Fc, 
followed by detection with biotinylated anti-Flag antibody Bio M2 (Kodak) and 
streptavidin -horseradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular mass of the complex of 420K (data not 
shown), which is consistent with a stoichiometry of two DcR3-Fc homodimers 
to two soluble FasL homotrimers. 

Equilibrium binding analysis. Microtitre wells were coated with anti-human 



IgG, blocked with 2% BSA in PBS. DcR3-Fc or Fas-Fc was added, followed by 
serially diluted Flag-tagged soluble FasL. Bound ligand was detected with anti- 
Flag antibody as above. In the competition assay, Fas-Fc was immobilized as 
above; and the wells were blocked with excess lgGl before addition of Flag- 
tagged soluble FasL plus DcR3-Fc. 

T-cell AICD. CD3* lymphocytes were isolated from peripheral blood of 
individual donors using anti-CD3 magnetic beads (Miltenyi Biotech), 
stimulated with phytohaemagglutinin (PHA; 2 p-g ml" 1 ) for 24 h, and cultured 
in the presence of interleukin- 2 (100 U ml" 1 ) for 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed for apoptosis 
16 h later.by FACS analysis of annexin-V-binding of CD4 + cells* 4 . 
Natural killer cell activity. Natural killer cells were isolated from peripheral 
blood of individual donors using an ti- CD 56 magnetic beads (Miltenyi 
Biotech), and incubated for 16h with 5l Cr-loaded Jurkat cells at an effector- 
to-target ratio of 1:1 in the presence of DcR3-Fc, Fas-Fc or human lgGl. 
Target -cell death was determined by release of **Cr in effector- target co- 
cultures relative to release of 5, Cr by detergent lysis of equal numbers of Jurkat 
cells. ; 

Gene-amplification analysis. Surgical specimens were provided by J. Kern 
(lung tumours) and P. Quirke (colon tumours). Genomic DNA was extracted 
(Qiagen) and the concentration was determined using Hoechst dye 33258 
intercalation fluorometry. Amplification was determined by quantitative PCR" 
using a TaqMan instrument ( ABI). The method was validated by comparison of 
PCR and Southern hybridization data for the Myc and HER- 2 oncogenes (data 
not shown). Gene -specific primers and fluorogenic probes were designed on 
the basis of the sequence of DcR3 or of nearby regions identified on a BAC 
carrying the human DcR3 gene; alternatively, primers and probes were based 
on Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159), the nearest 
available marker which maps to —500 kilobases from T160, and five extra 
markers that span chromosome 20. The DcR3 -specific primer sequences were 
5'-CTTCTTCGCGCACGCTG-3' and 5'-ATCACGCCGGCACCAG-3' and the 
fluorogenic probe sequence was 5 ' - ( FAM - ACACG ATGCGTGCTCCAAGCAG 
AAp-(TAMARA), where FAM is 5 '--fluorescein phosphoramidite. Relative 
gene-copy numbers were derived using the formula 2 (iCT1 , where ACT is the 
difference in amplification cycles required to detect DcR3 in peripheral blood 
lymphocyte DNA compared to test DNA. 

Received 24 September; accepted 6 November 1998. 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across- membranes of both prokaryotes and 
eukaryotes 1 . The recently completed Escherichia coli genome 
sequence revealed that the largest family of paralogous E. -coli 
proteins is composed of ABC transporters 2 . Many eukaryotic 
proteins of medical significance belong to this family, such as 
the cystic fibrosis transmembrane conductance regulator (CFTR), 
the P-glycoprotein (or multidrug-resistance protein) and the 
heterodimeric transporter associated - with antigen processing 
(Tapl-Tap2). Here we report the crystal structure at 1.5 A resolu- 
tion of HisP, the ATP-binding subunit of the histidine permease, 
which is an ABC transporter from Salmonella typhimurium. We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HisP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

ABC transporters contain four structural domains: two nucleo- 
tide-binding domains (NBDs), which are highly conserved 
throughout the family, and two transmembrane domains 1 . In 
prokaryotes these domains are often separate subunits which are 
assembled into a membrane -bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain. The 
periplasmic histidine permease of S. typhimurium and £ co/i u_B is a 
well-characterized ABC transporter that is a good model for this 
superfamily. It consists of a membrane-bound complex, HisQMP 2 , 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP, the ATP-binding subunit. HisP, which has 
•properties intermediate between those of integral and peripheral 
membrane proteins 9 , is accessible from both sides of the membrane, 
presumably by its interaction with HisQ and HisM 6 . The two HisP 
subunits form a dimer, as shown by their cooperativity in ATP 
hydrolysis 5 , the requirement for both subunits to be present for 
activity 4 , and the formation of a HisP dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer 3 . HisP has been purified 
and characterized in an active, soluble form 3 which can be recon- 
stituted into a fully active membrane-bound complex*. 

The overall shape of the crystal structure of the HisP monomer is 
that of an T with two thick arms (arm I and arm II); the ATP- 
binding pocket is near the end of arm I (Fig, 1). A six-stranded (3- 
sheet (p3 and |38-(312) spans both arms ofthe L, with a domain of a 
a- plus (3-type structure ((31, (32, (34~(37, al and a2) on one side 
(within arm I) and a domain of mostly a-helices (a3-a9) on the 




Figure 1 Crystal structure of HisP. a t View of the dimer along an axis 
perpendicular to its two-fold axis. The top and bottom ofthe dimer are suggested 
to face towards the periplasmic and cytoplasmic sides, respectively (see text). 
Trie thickness of arm II is about 25 A. comparable to that of membrane. a-Helices 
are shown in orange and p-sheets in green, b, View along the two-fold axis of the 
HisP dimer, showing the relative displacement of the monomers not apparent in 
a. The fi-strands at the dimer interface are labelled, c, View of one monomer from 
the bottom of arm I, as shown in a. towards arm II, showing the ATP-binding 
pocket, a-c, The protein and the bound ATP are in 'ribbon' and 'ball-and-stick' 
representations, respectively. Key residues discussed in the text are indicated in 
c. These figures were prepared with MOLSCRIPT 29 . N, amino terminus; C, C 
terminus. 
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Gene amplification is a common event in the progression of 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative polymerase*chain-reaction (PCR) method, 
based on fluorescent TaqMan methodology and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real-time, was used to quantify 
gene amplification in tumor DNA. Reactions are character- 
ized by the point during cycling when PCR amplification is still 
in the exponential phase, rather than the amount of PCR 
product accumulated after a ftxed number of cycles. None of 
the. reaction components is limited during the exponential 
phase, meaning that values are highly reproducible in reac- 
tions starting with the same copy number. This greatly 
improves the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-product carry-over con- 
tamination; it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The real-time PCR method, was used to develop and validate 
a simple and rapid assay for the detection and quantification 
of the 3 most frequently amplified genes (myc, ccndl and 
erbB2) in breast tumors. Extra copies of myc, ccndl and erbBZ 
were observed in 1 0, 23 and 1 5%, respectively, of 1 08 breast- 
tumor DNA; the largest observed numbers of gene copies 
were 4.6, 18-6 and 15.1, respectively. These results correlated 
well with those of Southern blotting. The use of this new 
semi-automated technique will make molecular analysis of 
human cancers simpler and more reliable, and should find 
broad applications in clinical and research settings. Int. L 
Cancer 78:661-666, 1998. 
© J 998 Wiley-Lis$ t Inc. 

Gene amplification plays an important role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosomal double minutes 
(dmins) or as integrated homogeneously staining regions (HSRs), 
are among the main visible cytogenetic abnormalities in breast 
tumors. Other techniques such as comparative genomic hybridiza- 
tion (CGH) (Kallioniemi ei aL, 1994) have also been used in broad 
searches for regions of increased DNA copy numbers in tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identify 
the critical gene(s) in each amplified region. To date, genes known 
to be amplified frequently in breast cancers include myc (8q24), 
ccnd\ (llql3), anderMJ2 ( 1 7ql 2 -q21) (for review, seeBiecheand 
Lidereau, 1995). 

Amplification of the myc, ccndJ, and erbBl proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis (Berns ei aL, 1992; 
Schuuring et aL, 1992; 91amon et aL, 1987). Muss et al, (1994) 
suggested that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. 

However, published results diverge both in terms of the fre- 
quency of these alterations and their clinical value. For instance,/ 
over 500 studies in 10 years have failed to resolve the controversy 



surrounding the link suggested by Slamon ei al. (1987) between 
erbBl amplification and disease progression. These discrepancies 
are partly due to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific genes (DNA) were initially quantified in tumor cells by 
means of blotting procedures such as Southern and slot blotting. 
These batch techniques require large amounts of DNA (5-10 
u.g/reaction) to yield reliable quantitative results. Furthermore, 
meticulous care is required at all stages of the procedures to 
generate blots of sufficient quality for reliable dosage analysis. 
Recently, PCR has proven to be a powerful tool for quantitative 
DNA analysis, especially with minimal starting quantities of tumor 
samples (small, early-stage rumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a varying number of cycles during the 
exponential phase (kinetic quantitative PCR). In the first case, an 
internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The method is relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quantifying a new gene, since no internal standard is required (an 
external standard curve is sufficient). Moreover, the kinetic method 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each aliquot, the amplifica- 
tion product. Interest in the kinetic method has been stimulated by a 
novel approach using fluorescent TaqMan methodology and a new 
instrument (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson et aL, 1996; Heid et 
aL, 1996). The TaqMan reaction is based on the 5' nuclease assay 
first described by Holland et aL (1991). The latter uses the 5/ 
nuclease activity of Taq polymerase to cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PGR. The 
approach uses dual -labeled fluorogenic hybridization probes (Lee 
ei aL, 1993). One fluorescent dye, co-valently linked to the 5' end 
of the oligonucleotide, serves as a reporter [FAM (Le., 6-carboxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA '.(i.e., 6-carboxy-tetramethyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydro lyzed by the 
5 '-3' nucleolytic activity of DNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the reporter dye (FAM) by the emission intensity of a 
reference dye (i.e., ROX, 6-carboxy-X-rhodamine) included in 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized 
reporter) for a given reaction tube. The use of a sequence detector 
enables the fluorescence spectra of all 96 wells of the thermal 
cycler to be measured continuously during PCR amplification. 

The real-time PCR method offers several advantages over other 
current quantitative PCR methods (Celi et ai, 1994): (i) the 
probe-based homogeneous assay provides a real-time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a 
signal; (ii) the C, (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product accumulation. This is the main reason why C, is a more 
reliable measure of the starting copy number than are end-point 
measurements, in which a slight difference in a limiting component 
can have a drastic effect on the amount of product; (Hi) use of C, 
values gives a wider dynamic range (at least 5 orders of magni- 
tude), reducing the need for serial dilution; (iv) The real-time PCR 
method is run in a closed-tube system and requires no post-PCR 
sample handling, thus avoiding potential contamination; (v) the 
system is highly automated, since the instrument continuously 
measures fluorescence in all 96- wells of the thermal cycler during 
PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
available just one minute after thermal cycling is complete; (vii) the 
sample throughput of the method is high, since 96 reactions can be 
analyzed in 2 hr. 

Here, we applied this semi-automated procedure to determine 
the copy numbers of the 3 most frequently amplified genes in breast 
tumors (myc, ccndl and erbB2\ as well as 2 genes (alb and app) 
located in a chromosome region in which no genetic changes have 
been observed in breast tumors. The results for 108 breast tumors 
were compared with previous Southern-blot data for the same 
samples. 



MATERIAL AND METHODS 
Tumor and blood samples 

Samples were obtained from 1 08 primary breast tumors removed 
surgically from patients at the Centre Rene Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy. Immedi- 
ately after surgery, the rumor samples were placed in liquid 
nitrogen until extraction of high-molecular-weight DNA. Patients 
were included in this study, if the tumor sample used for DNA 
preparation contained more than 60% of tumor cells (histological 
analysis). A blood sample was also taken from 18 of the same 
patients. 

DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 

Real-time PCR 

Theoretical basis. Reactions are characterized by the point 
during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PCR product accumulated 
after a fixed number of cycles. The higher the starting copy number 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed. The parameter C, (threshold cycle) is 
defined as the fractional cycle number at which the fluorescence 
generated by cleavage of the probe passes a fixed threshold above 
baseline. The target gene copy number in unknown samples is 
quantified by measuring Q and by using a standard curve to 
determine the starting copy number. The precise amount of 
genomic DNA (based on optical density) and its quality (i.e., lack 



of extensive degradation) are both difficult to assess. We therefore 
also quantified a control gene (alb) mapping to chromosome region 
4qll-ql3. in which no genetic alterations have been found in 
breast-tumor DNA by means of CGH (Kallioniemi et ai, 1 994). 

Thus, the ratio of the copy number of the target gene to the copy 
number of the alb gene normalizes the amount and quality of 
genomic DNA. The ratio defining the level of amplification is 
termed "N", and is determined as follows: ' 

copy number of target gene [app, myc, ccndl, erbB2) 

N = — ■ ■■ . 

copy number of reference gene (alb) 

Primers, probes, reference human genomic DNA and PCR 
consumables. Primers and probes were chosen with the assistance 
of the computer programs Oligo 4.0 (National Biosciences, Ply- 
mouth, MN), EuGene (Daniben Systems, Cincinnati, OH) and Primer 
Express (Perkin-Elmer Applied Biosystems, Foster City, CA). 

Primers were purchased from DNAgency (Malvern, PA) and 
probes from Perkin-Elmer Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybridization 
probes and primers are available on request. 

The TaqMan PCR Core reagent kit, Micro Amp optical tubes, 
and Micro Amp caps were from Perkin-Elmer Applied Biosystems. 

Standard-curve construction. The kinetic method requires a 
standard curve. The latter was constructed with serial dilutions of 
specific PCR products, according to Piatak et ai .(1993). In 
practice, each specific PCR product was obtained by amplifying 20 
rig of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same primer pairs as those used later for 
real-time quantitative PCR. The 5 PCR products were purified 
using MicroSpin S-400 HR columns (Pharmacia, Uppsala, Swe- 
den) electrophorezed through an acrylamide ge! and stained with 
ethidium bromide to check their quality. The PCR products were 
then quantified spectrophotometrically and pooled, and serially 
diluted 1 0-fold in mouse genomic DNA (Clontech, Palo Alto, CA) 
at a constant concentration of 2 ng/pl The standard curve used for 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging from 10" 7 (1 0 s copies of each gene) to 
10 -10 (10 2 copies). This series of diluted PCR products was 
aliquoted and stored at - 80°C until use. 

The standard curve was validated by analyzing 2 known 
quantities of calibrator human genomic DNA (20 ng and 50 rig). 

PCR amplification. Amplification mixes (50 ul) contained the 
sample DNA (around 20 ng, around 6600 copies of disomic genes), 
10X TaqMan bufTer (5 nl), 200 uM dATP, dCTP, dGTP, and 400 
uM dUTP, 5 mM MgCl 2 , L25 units of AmpliTaq Gold, 0.5 units of 
AmpErase uracil N-glycosylase (UNG), 200 nM each primer and 
100 nM probe. The thermal cycling conditions comprised 2 min at 
50°C and 10 min at 95°C. Thermal cycling consisted of 40 cycles at 
95°C for 15 s and 65°C for I min. Each assay included: a standard 
curve (from 10 5 to 10 2 copies) in duplicate, a no-template control, 
20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 
triplicate, and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus be analyzed on a 96- well microplate). All 
samples with a coefficient of variation (CV) higher than 10°/o were 
retested. 

All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fluorogenic probe during PCR. 

Equipment for real-time detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each of the 96 sample wells. A charge-coupled-device (CDD) 
camera collects the emission from each sample and the data are 
analyzed automatically. The software accompanying the 7700 
system calculates C, and determines the starting copy number in the 
samples. 
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Determination of gene amplification. Gene amplification was 
calculated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. 

RESULTS 

To validate the method, real-time PCR was performed on 
genomic DNA extracted from 108 primary breast tumors, and. 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndl and erb&2 proto-oncogenes, 
and the p-amyloid precursor protein gene (app), which maps to a 
chromosome region (2 lq21 .2) in which no genetic alterations have 
been found in breast rumors (Kallioniemi et al, 1994). The 
reference disomic gene was the albumin gene {alb, chromosome 
4qll-ql3). 



Validation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was constructed from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ng/ul. It should be noted that the 5 primer pairs chosen to analyze 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown). Figure 1 shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing as few as 10 2 copies or as 
many as 10 5 copies. 

Copy-number ratio of the 2 reference genes fapp and alb) 

The app to alb copy-number ratio was determined in 1 8 normal 
leukocyte DNA samples and all 108 primary breast-tumor DNA 
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Figure 1 - Albumin (alb) gene dosage by real-time PCR. Top: Amplification plots for reactions with starting alb gene copy number ranging 
from I0 3 (A9), 10 4 (A7), 10 3 (A4) to 10 5 (A2) and a no-template control (Al). Cycle number is plotted vs. change in normalized reporter signal 
(ARn). For each reaction tube, the fluorescence signal of the reporter dye (FAM) is divided by the fluorescence signal of the passive reference dye 
(ROX), to obtain a ratio defined as the normalized reporter signal (Rn). ARn represents the normalized reporter signal (Rn) minus the baseline 
signal established in the first 15 PCR cycles. ARn increases during PCR as alb PCR product copy number increases until the reaction reaches a 
plateau. Q (threshold cycle) represents the fractional cycle number at which a significant increase in Rn above a baseline signal (horizontal black 
line) can first be detected. Two replicate plots were performed for each standard sample, but the data for only one are shown here Bottom- 
Standard curve plotting log starting copy number vs. C, (threshold cycle). The black dots represent the data for standard samples plotted in 
duplicate and the red dots the data for unknown genomic DNA samples plotted in triplicate. The standard curve shows 4 orders of linear dynamic 
range. 
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samples. We selected these 2 genes because they are located in 2 
chromosome regions {app. 21q21.2; alb, 4ql I -q 13) in which no 
obvious genetic changes (including gains or losses) have been 
observed in breast cancers (Kallioniemi et al, 1994). The ratio for 
the 18 normal leukocyte DNA samples fell between 0.7 and 1.3 
(mean 1.02 ± 0.21), and was similar for the 108 primary breast- 
tumor DNA samples (0.6 to 1.6, mean 1.06 ± 0.25), confirming 
that alb and app are appropriate reference disomic genes for 
breast-tumor DNA. The low range of the ratios also confirmed that 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject's DNA would have resulted in differential amplification. 

myc, ccndl and erb£2 gene dose in normal leukocyte DNA 

To determine the cut-off point for gene amplification in breast- 
cancer tissue, 18 normal leukocyte DNA samples were tested for 
the gene dose (N), calculated as described in "Material and 
Methods". The N value of these samples ranged from 0.5 to 1.3 
(mean 0.84 ± 0.22) for myc; 0.7 to 1.6 (mean 1.06 ± 0.23) for 
ccndl and 0.6 to 1.3 (mean 0.91 ± 0.19) forer&B2. SinceNvalues 
for myc, ccndl and erbBl in normal leukocyte DNA consistently 
fell between 0.5 and 1.6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. 

myc, ccndl and erbB2 gene dose in breast-tumor DNA 

myc, ccndl and erbBl gene copy numbers in the 108 primary 
breast tumors are reported in Table I. Extra copies of ccndl were 
more frequent (23%, 25/108) than extra copies of erbBl (15%, 
16/108) and myc (10%, 11/108), and ranged from 2 to 18.6 for 
ccndl, 2 to 15.1 for er£B2, and only 2 to 4.6 for the myc gene. 
Figure 2 and Table IT represent tumors in which the ccndl gene was 
amplified 16-fold (T145), 6-fold (T133) and non-amplified (TU8). 
The 3 genes were never found to be co-amplified in the same tumor. 
erb&2 and ccndl were co-amplified in only 3 cases, myc and ccndl 
in 2 cases and myc and er6B2 in 1 case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 50% in the 
erbBl copy number (N < 0.5), suggesting that they bore deletions 
of the 17q21 region (the site of er6B2). No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

. Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southern-blot analysis of myc, ccndl and erbBl amplifications 
had previously been done on the same 1 08 primary breast tumors. A 
perfect correlation between the results of real-time PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N s 5). However, there were cases (1 myc, 6 ccndl and 4 erbBl) 
in which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2.9). 

DISCUSSION 

The clinical applications of gene amplification assays are 
currently limited, but would certainly increase if a simple, standard- 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TABLE I - DISTRIBUTION OF AMPLIFICATION LEVEL (N) FOR myc, 
ccndl AND erbB2 GENES IN 108 HUMAN BREAST TUMORS 



Gene 




Amplification level (N) 




<0.5 


0.5-1.9 2-4.9 


as 


myc 


0 


97(89.8%) 11 (10.2%) 


0 


ccndl 


0 


83 (76.9%) 17(15.7%) 


8 (7.4%) 


erbB2 


5 (4.6%) 


87 (80.6%) 8 (7.4%) 


8 (7,4%) 



reagents and requires relatively large amounts of high-quality 
genomic DNA, which means it cannot be used routinely in many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minimal 
quantities of tumor DNA (small early-stage tumors, cytopuncture 
specimens or formalin-fixed, paraffin-embedded tissues). 

In this study, we validated a PCR method developed for the 
quantification of gene over-representation in rumors. The method, 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-based quantitative assays such as 
competitive quantitative PCR (Celi et ai, 1 994). First, the real-time 
PCR method is performed in a closed-tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of 
carryover PCR products in subsequent experiments can also be 
prevented by using the enzyme uracil N-glycosylase (UNG) 
(Longo ei ai, 1990). The second advantage is the simplicity and 
rapidity of sample analysis, since no posl-PCR manipulations are 
required. Our results show that the automated method is reliable. 
We found it possible to determine, in triplicate, the number of 
copies of a target gene in more than 1 00 tumors per day. Third, the 
system has a linear dynamic range of at least 4 orders of magnitude, 
meaning that samples do not have to contain equal starting amounts 
of DNA. This technique should therefore be suitable for analyzing 
formalin-fixed, paraffin-embedded tissues. Fourth, and above all, 
real -time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C t values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the ABI Prism 7700 Sequence Detection System enables C, to be 
calculated when PCR amplification is still in the exponential phase 
and when none of the reaction components is rate-limiting. The 
within-run CV of the C t value for calibrator human DNA (5 
replicates) was always below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
Q rauo of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-time PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is laborious). 

The only potential disavantage of real-time PCR, like all other 
PCR-based methods and solid-matrix blotting techniques (South- 
ern blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from tumor cells contained in 
heterogeneous tissue specimens. Only FISH and immunohistochem- 
istry can measure alterations on a cell-by-cell basis (Pauletti et ai, 
19915; Slamon et ai, 1989). However, FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FISH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in 
the absence of gene amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immunohistochemistry is 
subject to considerable variations in the hands of different teams, 
owing to alterations of target proteins during the procedure, the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in 
the literature. (0 Chromosome regions 4qll-ql3 and 21q21.2 
(which bear alb and app, respectively) showed no genetic alter- 
ations in the breast-cancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et al, 1994). (ri) We found that 
amplifications of these 3 oncogenes were independent events," as 
reported by other teams (Berns et al, 1 992; Borg et ai, 1 992). (Hi) 
The frequency and degree of myc amplification in our breast tumor 
DNA series were lower than those of ccndl and erbBl amplifica- 
tion, confirming the findings of Borg et al. ( 1 992) and Courjal et al. 
(1997). (iv) The maxima of ccndl and erbBl over-representation 
were 1 8- fold and 1 5 -fold, also in keeping with earlier results (about 
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ALB 
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I T118 



Ct Copy number &t Copy number 
27.3 4605 26.5 4365 



■ T133 23.2 61659 25.2 10092 

■ T145 22.1 125892 25.6 7762 

Figure 2 - ccndl and alb gene dosage by real-time PCR in 3 breast tumor samples: Tl 1 8 (E 1 2, C6, black squares), T133 (Gil, B4, red squares) 
and T145 (A8, C8, blue squares). Given the C, of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experiment Triplicate plots were performed for each tumor sample, but the data for only one are shown here. The results are shown in Table II. 



30-fold maximum) (Bems et aL, 1992; Borg et aL, 1992; Courjal et 
aL, 1997). (v) The erbBl copy numbers obtained with real-time 
PCR were in good agreement with data obtained with other 
quantitative PCR-based assays in terms of the frequency and 
degree of amplification (An et aL, 1995; Deng et aL, 1996; Valeron 



et aL, 1996). Our results also correlate well with those recently 
published by Gelmini et aL ( 1 997), who used the TaqMan system to 
measure er6B2 amplification in a small series of breast tumors 
(n = 25), but with an instrument (LS-50B luminescence spectrom- 
eter, Perkin-Elmer Applied Biosy stems) which only allows end- 
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TABLE H - EXAMPLES OF ccnd! GENE DOSAGE RESULTS 
FROM 3 BREAST TUMORS' 



Tumor 




ccndl 






alb 




Hccndt/alb 


Copy 
number 


Mean 


SD 


Copy 
number 


Mean 


SD 


T118 


4525 






4223 










4605 


4603 


77 


4365 


4325 


89 


1.06 




4678 






4387 








T133 


59821 






9787 










61659 


61100 


mi 


10092 


10137 


375 


6.03 




61821 






10533 








T145 


128563 






7321 










125892 


125392 


3448 


7762 


7672 


316 


16.34 




121722 






7933 









l For each sample, 3 replicate experiments were performed and the mean 
and the standard deviation (SD) was determined. The level of ccndl gene 
amplification (NccndJ/alb) is determined by dividing the average ccnd J 
copy namber vaJ ue by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(> 5-fold). The slightly higher frequency of gene amplification 
(especial Jy ccnd] and er£B2) observed by means of real-time 
quantitative PCR as compared with Southem-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, tetrasomy or polysomy) 
ratheT than true gene amplification. . These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et al, 1992; 
Slamon er a/., 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erb&l (but not of the other 2 proto- 
oncogenes) in several tumors; erbB2 is located in a chromosome 
region (17q21) reported to contain both deletions and amplifica- 
tions in breast cancer (Bieche and Lidereau, 1995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre -neoplasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment. Southern blotting 
is not sufficiently sensitive, and FISH is lengthy and complex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a place, in routine clinical 
gene dosage. 
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been leading Genentech f s Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins' 4 . When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech' s Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely usefiil for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
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above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our 
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correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 
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opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 
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(3) Menschliche Nukleinsauresequenzen aus Blase-Tumor 

® Es werdon menschliche Nukleinsauresequenzen - 
mRNA, cDNA, genomrscho Sequenzen - aus Blasentu- 
morgewebo, die fur Gcnprodukle oder Teile da von kodio- 
ren, und deren Vorwcndung beschrieben. 
Es werden weiterhin die uber die Sequenzen erhaltlichen 
Polypeptide und deren Verwendung beschrieben. 



< 

S 

00 
r— 

00 



1U 

a 



BUNDESDRUCKEREI 09.99 902 043/433/1 



29 



DE 198 18 619 A 1 

Beschreibung 

Die Erfindung betrifft menschliche Nuklcinsauresequenzcn aus Blascntumorgcwebc, die fur Genproduktc oder Tcilc 
davon kodieren, deren funktionale Gene, die mindestens cin biologiscb aktives Polypeptid kodieren und deren Vfcrwen- 
5 dung. 

Die Erfindung betrifft weiterhin die liber die Sequenzen erhaltlichen Polypeptide und deren Verwendung. 

Eine der Hauptkrebstodesursachen ist der Blasentumor, fur dessen Bekampfung neue Hierapien notwendig sind. Bis- 
her verwendete Therapien, wie z. B. Chemotherapie, Honnonlherapie oder chirurgische Enlfemung des 1\imorgcwebcs, 
fuhren haufig nicht zu einer vollstandigen Heilung. 
10 Das Phanomcn Krebs geht haufig cinher mit der ttber- oder Untercxpression gewisser Gene in den entarteten Zellcn, 
wobei noch unklar ist, ob diese verandcrten Expressionsraten Ursacbe oder Folge der malignen Transformation sind. Die 
Identification solcher Gene ware ein wesentlicher Schritt flir die Entwicklung neuex Therapien gegen Krebs. Der spon- 
tanen EntstchuDg von Krebs geht baufig eine Vielzahl von Mutationen voraus. Diese konnen vcrschiedenste Auswirkun- 
gen auf das Exprcssionsmuster in dem betroffenen Gewebe baben, wie z. B. Unter- oder Uberexpression, aber auch Ex- 
15 pression verkiirztcr Gene. Mehrcre solcber Veranderungen durcb solche Mutationskaskaden konnen schlieBlicb zu bos- 
arligcn Entartungcn fuhren. Die Komplexitat solcher Zusammenhiinge erschwert die experinientelle Herangehensweise 
sebr. 

Fur die Suche nach Kandidatcngenen, d. h. Genen, die im Vergieich zum 1\jmorgewebe im normalen Gewebe starker 
exprimiert werden, wird eine Dalenbank verwendet, die aus sogenannten ESTfc besteht. ESTb (Expressed Sequence Tags) 

20 sind Sequenzen von cDNAs, d. h. rovers transkribierten mRNAs, den Molekiilen also, die die Expression von Genen wi- 
derspiegeln. Die EST-Sequenzen werden fur normale und entartcte Gewebe ermittelt. Solche Datenbanken werden von 
yerschiedenen Betrcibern z. T. kommerziell angeboten. Die ESlfc der LifeSeq-Datenbank, die bier verwendet wird, sind 
in der Regel zwischen 150 und 350 Nukleotide lang. Sie reprasentieren cin fur ein bestimmtes Gen unverkennbarcs Mu- 
ster, obwohl dieses Gen normalerweisc sehr viel langer ist (> 2000 Nukleotide). Durcb Vergieich der Exprcssionsmuster 

25 von normalen und T\jmorgewebe konnen ESTfc identifiziert werden, die fur die 1\imorentstehung und -proliferation 
wichu'g sind. Es bestebt jedoch folgendes Problem: Da durch untcrschiedliche Konstruktionen der cDNA-Bibhothcken 
die gefundencn EST-Sequenzen zu unterechieduchen Regionen eines unbekannten Gens gchoren konnen, ergabe sicb in 
cinem solchen Fall ein voUig falsches Verhaltnis des Vorkommens dieser ESTs in dem jeweibgen Gewebe. Dieses wiirde 
erst bemcrkt werden, wenn das vollstandigc Gen bekannt ist und somit die ES'fli dem gleichen Gen zugeordnet werden 

30 konnen. 

Es wurde nun gefunden, daB diese Fehlermoghchkeit verringert werden kann, wenn zuvor samdiche ES r lfc aus dem je- 
wciligen Gewebstyp assembliert werden, bevor die Exprcssionsmuster miteinander vcrglichen werden. Es wurden also 
uberlappcndc ESTs ein und dcsselbcn Gens zu langeren Sequenzen zusammengcfaBt (s. Fig. 1, Fig. 2a und Fig. 3). 
Durch diese Verlangerung und damit Abdeckung eines wcsentlich groBeren Genbereichs in jeder der jeweibgen Bankcn 
3S solltc der oben beschriebene Fehier weitgehendst vermieden werden. Da es hierzu keine bestehenden Softwareprodukte 
gab, wurden Programme fur das Asscmblicrcn von genomischen Abscbnitten verwendet, die abgewandelt eingcsctzl und 
durch eigene Programme erganzt wurden. Ein Flowchart der Assemblierungsprozedur isl in Fig. 2bl-2b4 dargcstellt 

Es konnten nun die Nukleinsaure-Sequenzen Seq. ID No. 1 50 gefunden werden, die als Kandidatengene beim Bla- 
sentumor eine Rolle spielen. 

40 Von besonderem Interessc sind die Nukleinsaure-Sequenzen Seq. ID Nos. 2-5, 7-13, 16, 1 8, 20, 23, 26-27, 3 1-32, 36, 
45. 

Die Erfindung betrifft somit Nukleinsaure-Sequenzen, die ein Genprodukt oder ein Tbil davon kodieren, umfassend 

a) eine Nukleinsaure-Sequenz, ausgewahlt aus der Gruppe der Nukleinsaure-Sequenzen Seq. ID Nos. 2-5 7-13 
4S 16, 18, 20, 23, 26-27, 31-32, 36, 45. 

b) cine allelische Variation der unter a) genanntcn Nukleinsaure-Sequenzen 
oder 

c) eine Nukleinsaure-Sequenz, die komplementar zu den unter a) oder b) genannten Nukleinsaure-Sequenzen isl. 

50 Die Erfindung betrifft weiterhin eine Nukleinsaure-Sequenz gemaB einer der Sequenzen Seq, ID Nos 2-5 7-13 16 
1 8, 20, 23, 26-27, 31-32, 36, 45 oder eine komplementare oder allelische Variante davon und die Nukleinsaure-Sequen- 
zen davon, die eine 90%ige bis 95%igc Homologie zu einer humanen Nukleinsaure-Sequenz aufweisen. 

Die Erfindung betrifft auch die Nukleinsaure-Sequenzen Seq. ID No. 1 bis Seq. ID No. 50, die im Blasentumorgewebe 
erhobt exprimiert sind. 

55 Die Erfindung bctrifll ferner Nukleinsaure-Sequenzen, umfassend einen Teil der oben genannten Nukleinsaure-Se- 
quenzen, in solch einer ausreichenden GroBe, daB sie mil den Sequenzen Seq. ID Nos. 1-50 hybridisieren. 

Die erfindungsgemaBcn Nukleinsaure-Sequenzen weisen im allgemeinen eine Lange yon mindestens 50 bis 4500 bp, 
vorzugsweise eine Lange von mindestens 150 bis 4000 bp, insbesonderc eine Lange von 450 bis 3500 bp auf. 

Mit den erfindungsgemaBcn Tcilscqucnzen Seq. ID Nos. 1-50 konnen gemaB gangiger Verfahrenspraxis auch Exprcs- 
60 sionskassettcn konstruiert werden, wobci auf der Kassettc mindestens eine der erfindungsgcmaBen Nukleinsaure-Se- 
quenzen zusammcn mit mindestens einer dem Fachmann allgcmcin bekannten Kontroll- oder regulatorischen Sequenz, 
wie /.. B. einein geeigneten Prornotor, kombinierl wird. Die erfindungsgemaBcn Sequenzen konnen in sense oder anti- 
sense Orienticrung eingefugt sein. 
In der Literatur sind ist cine groBe Anzahl von Expressionskasscttcn bzw. Vektoren und Promotoren bekannt, die vcr- 
65 wendct werden konnen. 

Unter Expressionskassetten bzw. Vektoren sind zu verstchen: 1. baktcricllc, wie z. B., phagescript, pBs, (|>X174, pBIu- 
cscript SK, pBs KS, pNI18a, pNHl 6a, pnH 18a, pNH46a (Stratagenc), pTYc99A, pKK223-3, pKK233-3, pbR540,'pRIT5 
(Pharmacia), 2. cukaryontische, wie z. B. pWLneo, pSV2cat, pOG44, pXTl, pSG (Stratagcne), pSVK3, pBPv/pMSG 
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pSVL (Pharmacia). 

Unler Kontroll- oder regulatorischcr Sequenz sind geeignete Prornotoren zu verstehcn. Hierbei sind zwei bevorzugte 
Vektorcn der pKK232-8 und dcr PCM7 \fektor. Im einzelnen sind folgende Prornotoren gemeint: lad, lacZ, T3, 17, gpt, 
lambda P R , trc, CMV ? HS V Thymidin-Kinase, S V40 f LTRs aus Retrovirus und Maus Metallothionein-I. 

Die auf der Exprcssionskassette befindlichen DNA-Sequenzen kdnneo ein Fusionsprotein kodieren, das ein bekanntes 5 
Protein und ein biologisch aktivcs Polypcptid-Fragment umfaBL 

Die Expressionskassetten sind ebenfalls Gcgcnstand der vorLiegenden Erfindung. 

Die erfindungsgemaBen Nukleinsaure-Fragmente konnen zur Herstellung von \bllangen-Genen verwendet werden. 
Die erhaltlichen Gene sind ebenfalls Gegenstand der vorliegenden Erfindung. 

Die Erfindung betrifll auch die Verwendung der erfindungsgemaBen Nukleinsaure-Sequenzen, sowie die aus der \fer- 10 
wendung erhaltlichen Gen-Fragmente. 

Die erfindungsgemaBen Nukleinsaure-Sequenzen konnen mit geeigneten Vektorcn in Wirtszelien gebracht werden, in 
denen als heterologer Teil die auf den Nuklcinsaure-Fragmcnten enlhaltene genetiscben Information befindet, die expri- 
miert wird. 

Die die Nukleinsaure-Fragmente enthaltenden Wirtszelien sind ebenfalls Gegenstand der vorliegenden Erfindung. is 

Geeignete Wirtszelien sind z. B. prokaryontische Zellsysteme wie E. coli oder eukaryontische Zellsysteme wie tieri- 
sche oder humane Zellcn oder Hefen. 

Die erfindungsgemaBen Nukleinsaure-Sequenzen konnen in sense oder antisense Form verwendet werden. 

Die Herstellung der Polypeptide oder deren Fragment erfolgt durch Kulti viemng der Wirtszelien gemaB gangiger Kul- 
tivierungsmethoden und anschlieBender Isolierung und Aufreinigung der Peptide bzw. Fragmente, ebenfalls mittels gan- 20 
giger Verfahren. Die Erfindung betrifft femer Nukleinsaure-Sequenzen, die mindestens eine Teil sequenz eines biolo- 
gisch aktiven Polypeptids kodieren. 

Fcrner betrifft die vorliegende Erfindung Polypeptid-Tbilsequenzen, sogenannte ORF (open-reading-frame)-Peptide, 
gemaB den SequenzprotokoUen ORF ID Nos. 51-106. 

Die Erfindung betrifft femer die Polypeptid-Sequenzen, die mindestens eine 80%ige Homologie, insbesondere eine 25 
90%ige Homologie zu den erfindungsgemaBen Polypcptid-Icilsequcnzcn dcr ORF. ID Nos. 51-106 aufweisen. 

Die Erfindung betrifll auch Antikorper, die gegen ein Polypeptid oder Fragment davon gerichtete sind, welche von den 
erfindungsgemaBen Nukleinsauren der Sequenzcn Seq. ID No. 1 bis Seq. ID 50 kodiert werden. 

Unter Antikorper sind insbesondere monoklonalc und Phagc-Display- Antikorper zu verstehen. 

Die erfindungsgemaBen Polypeptide der Sequenzen ORF ID Nos. 5 1-106 konnen auch als Tool zum Auffinden von 30 
Wirkstoffen gegen den Blasentumor verwendet werden, was ebenfalls Gegenstand der vorliegenden Erfindung ist. 

Ebenfalls Gegenstand der vorliegenden Erfindung ist die Verwendung der Nukleinsaure-Sequenzen gemaB den Se- 
quenzen Seq. ID No. 1-50 zur Expression von Polypeptiden, die als Tools zum Auffinden von Wirkstoffen gegen den 
Blasentumor verwendet werden konnen. 

Die Erfindung betrifft auch die Verwendung der gefundenen Polypepud-lcilsequenzen ORF. ID No. 51-106 als Arz- 35 
neimittel in dcr Gcntherapie zur Behandlung gegen den Blasentumor, bzw. zur Herstellung eines Arzneimittcls zur Bc- 
handlung gegen den Blasentumor. 

Die Erfindung betrifll auch Arzneimittcl, die mindestens eine Poiypeptid-Teilsequenz ORF. ID No. 5 1-1 OS enthalten. 

Die gefundenen erfindungsgemaBen Nukleinsaure-Sequenzen konnen auch genomische oder rnRNA-Sequenzen sein. 

Die Erfindung betrifft auch genomische Gene, ihre Exon- und Intronstruktur und deren SpleiBvarianten, erhaltlich aus 40 
den cDNAs der Sequenzen Seq. ID No. 1-50, sowie deren Verwendung zusammen mit geeigneten regulativen Elemen- 
ten, wie geeigneten Prornotoren und/oder Enhancem. 

Mit den erfindungsgemaBen Nukleinsauren (cDNA-Sequenzen) Seq. ID No. 1-50 werden genomische BAC-, PAC- 
und Cosmid-Bibliotheken gescreent und fiber komplementare Basenpaarung (Hybridisierung) spezifisch humane Klone 
isoliert. Die so isolierten BAG-, PAC- und Cosmid-Klone werden mit Hilfe der Fluoreszenz-in-situ-Hybridisation auf 45 
Metaphasenchromosomcn hybridisiert und entsprechende Chromosomenabschnitte identifiziert, auf denen die entspre- 
chenden genomischen Gene liegen. BAC-, PAC- und Cosmid-Klone werden sequenziert, urn die entsprechenden genom- 
ischen Gene in ihrer vollstandigen Struktur (Prornotoren, Enhancer, Silencer, Exons und Introns) aufzuklaren. BAG-, 
PAC- und Cosmid-Klone konnen als eigenstandige Molekiile fur den Genlransfer eingesetzt werden (s. Fig. 5). 

Die Erfindung betrifft auch BAC-, PAC- und Cosmid-Klone, enthaltend funktionelle Gene und ihre cbromosoraale 50 
Lokalisation, entsprechend den Sequenzen Seq. ID. No. 1 bis Seq. ID No. 50, zur Verwendung als Vehikel zum Gentrans- 
fer. 

Bedeutungen von Fachbegriffen und Abkiirzungen 

Nukleinsauren = Unter Nukleinsauren sind in der vorliegenden Erfindung zu verstehen: mRNA, partielle cDNA, vollan- 
gen cDNA und genomische Gene (Chromosomen). 

ORF = Open Reading Frame, eine definierte Abfolge von Aminosaurcn, die von der cDNA-Sequenz abgeleitet werden 
kann. 

Contig =. cine Menge von DNA-Sequenzen, die aufgrund schr groBer Ahnlichkeiten zu einer Sequenz zusammengefaBt 60 
werden konnen (Consensus) 

Singleton = ein Conug, der nur eine Sequenz cnlhaK. 

Erklarung zu den Alignmemparamctcm 



minimal initial match = minimaJcr anfanglicher Idenlitatsbcrcicb 
maximum pads per read = maximalc Anzahl von Insertioncn 
maximum percent mismatch = maximale Abwcichung in %. 
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Erklarung dcr Abbildungen 

. Fig, 1 zeigt die syslcmalische Gen-Sucbe in der Incyle IifcScq Datenbank. 
Fig. 2a zeigt das Prinzip der EST-Assemblierung 
Fig. 2bl-2b4 zeigt das gesamte Prinzip der EST-Assemblierung 
Fig. 3 zeigt die in silico Subtraktion dcr Genexpression in verschiedenen Geweben 
Fig. 4a zeigt die Bcstimmung der gewebsspezifischen Expression iibcr elektronischen Northern. 
Fig. 4b zeigt den elektronischen Northern 

Fig. 5 zeigt die Isolierung von genomischcn BAC- und PAC-Klonen. 

Die nachfolgendcn Beispicle erliiutcrn die HerstelluDg der erfindungsgcmaBen Nukleinsaure-Sequenzen, ohne die Er- 
findung auf diese Beispiele und Nukieinsaure-Sequenzen zu beschranken. 

. Bcispiel 1 

Sucbe nach Tumor-bezogencn Kandidatengenen 

Zuerst wurden samtliche ESTs des eDtsprechendeo Gewebes aus der IifeSeq-Datenbank (vora Oktober 1997) extra- 
hiert Diese wurden dann mittels des Programms GAP4 des Staden-Pakets mil den Parameters 0% mismatch, 8 pads per 
read und einem minimalen match von 20 assembliert Die nicht in die GAP4-Datenbank aufgenommenen Scquenzen 
(Fails) wurden erst bei 1 % mismatch und dann nochmals bci 2% mismatch mit der Datenbank assembliert Aus den Con- 
tigs der Datenbank, die aus mehr als einer Sequenz bestanden, wurden Consensussequenzen errechnet Die Singletons 
der Datenbank, die nur aus einer Sequenz bestanden, wurden mit den nicht in die GAP4-Datenbank aufgenommenen Se- 
quenzen bei 2% mismatch emeut assembliert. Wiedcrum wurden fur die Contigs die Consensussequenzen crmittelt Alle 
ubrigen ESTs wurden bei 4% mismatch emeut assembliert Die Consensussequenzen wurden abermals extrahiert und 
mit den vorherigen Consensussequenzen sowie den Singletons und den nicht in die Datenbank aufgenommenen Sequen- 
zen abschlicBcnd bei 4% mismatch assembliert Die Consensussequenzen wurden gebildet und mit den Singletons und 
Fails als Ausgangsbasis fur die Gewebsvergleiche verwendet Durch diese Prozedur konnte sichergesteUt werden, daB 
untcr den verwendeten Parametern samtliche Sequenzen von einander unabhangige Genbcreicbedarstellten. 

Fig. 2bl-2b4 veranschaulicht die Vcrlangerung der Blasengcwebe ESTs. 

Die so assemblierten Sequenzen der jeweiligen Gewebe wurden anschlieBend mittels des gleichen Programms mitein- 
ander verghchen (Fig. 3). Hierzu wurden erst alle Sequenzen des ersten Gewebes in die Datenbank eingegeben. (Daher 
war es wichtig, daB diese voneinander unabhangig waren.) 

Dann wurden alle Sequenzen des zwciten Gewebes mit alien des ersten vcrglichen. Das Ergebnis warcn Sequenzen, 
die fur das crste bzw. das zweite Gewebe spezifisch waren, sowie wclche, die in beiden vorkamen. Bei Letzteren wurde 
das Verhaltnis der Haufigkeit des Vorkommens in den jeweiligen Geweben ausgewcrtet Samtliche, die Auswertung der 
assemblierten Scquenzen betrefiFenden Programme, wurden selbst entwickelt. 

Alle Sequenzen, die mehr als viermal in jeweils einem der verglicheoen Gewebe vorkamen, sowie alle, die mindestens 
funfmal so haufig in einem der beiden Gewebe vorkamen wurden writer unlersucht. Diese Sequenzen wurden einem 
elektronischen Northern (s. Beispiel 2.1) untcrzogen, wodurch die Verteilung in samtlichen Tumor- und Normal-Gewe- 
ben untersucht wurde (s. Fig. 4a und Fig. 4b). Die relevanten Kandidaten wurden dann mit Hilfe samtlicher Incyte ESTs 
und alien ESTs offentlicber Datenbanken verlangert (s. Beispiel 3). AnschlieBend wurden die Sequenzen und ihre Ober- 
setzung in mogliche Proteine mit alien Nukleolid- und Proteindatenbanken verghchen, sowie auf mogliche, fur Proteine 
kodierende Regionen untersucht. 

Beispiel 2 

Algorithmus zur Identifikation und Verlangerung von partiellen cDNA-Sequenzen mit verandertem Expressionsmuster 

^ Im folgenden soU ein Algorithmus zur Auffindung uber- oder unterexprimierter Gene erlautert werden. Die einzelnen 
Schritte sind der besseren Ubersicht halber auch in einem FluBdiagramm zusammengefaBt (s. Fig. 4b). 

2. 1 Elektronischcr Northem-Blot 

Zu einer partiellen DNA-Sequenz S, z. B. einem einzelnen EST oder einem Cbntig von ESTs, werden mittels eines 
Standardprogramms zur Homolgiesuche, z. B. BLAST (Altschul, S. F., Gish W., Miller, W., Myers, E. W und Lipman 
D. J. (1990) J. Mol. Biol., 215, 403-410), BLAST2 (Altscbul, S. F, Madden, T. L., Schaffer, A. A., Zhang, J , Zhang Z.' 
MiUer, W. und Lipman, D. J. (1997) Nucleic Acids Research 25 3389-3402) oder FASTA (Pearson, W. R. und lipman' 
D. J. (1988) Proc. Natl. Acad. Sci. USA 85 2444-2448), die horaologen Sequenzen in verschiedenen nach Geweben ge^ 
ordncten (privatcn oder SfTenUichcn) EST-Bibliotheken bestimmt. Die dadurch ermitlellcn (rclaUvcn oder absoluten) 
Gewebe-spczifischen Vorkommenshaufigkciten dieser Partial-Sequenz S werden als elektronischcr Northem-Blot be- 
zeichnet 

2.1.1 

Analog der untcr 2. 1 beschriebenen Verfahrensweise wurde die Sequenz Scq. ID No. 1 6 gefunden, die 1 7,7.x starker 
im normalen Blasenlumorgewebe als im nonnalem Blasengewebc vorkommt 
Das Ergebnis ist wie folgt: 
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Elektroniscber Northern fur SEQ. ID. NO: 16 





NORMAL 


TUMOR 


Verbaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Blase 




U . UD jU 


0.0565 17.6998 


Bnist 


U. UUUU 


U. UUUU 


unaer 


undef 


VUgIUIUuUU 


U- UUJ1 


n nnnn 
U. UUUU 


undef 


n nnnn 
U. UUUU 




U. UJLOU 


n nnio 
U.Uu/D 




n a i a i 




n nnnn 
U . UUUU 


n nnnn 
0. 0000 


undef 


undef 


~\ +~ A 1 »"\ <»\ f> #^ "C ^ 


n i\f\f\r\ 
U.0000 


0.0000 


undef 


undef 




U. UUUU 


n nnnn 
U . UUUU 


undef 


undef 


naeinabupgQvlSCn 


n nnnn 
U . UUUU 


ft nnnn 
U. UUUU 


undef 


under. 


OdUt 


ft nnnn 


ft nnnn 
U. UUUU 


undef 


undef 




u • UUUU 


ft nnnn 
U* UUUU 


undef 


undef 


Herz 


n nnnn 

U • V\J\JU 


n nnnn 
U . UUUU 


undef 


undef 


Hoden 


n nnnn 


n ftftftft 

u • UUUU 


undef 


undef 


jjunge 


0 0000 


0.0000 


undef 


undef 


Macicn — Sd©3. s e iroeh t*» 


0 0000 

V 9 vvvv 


o nnnn 

u • uuuu 


undef 


undef 


Muskel-Skelett 


o nnnn 

U . UUUU 


ft ftftftft 

V • UUUU 


undef* 


undef 


Niere 


ft flftftft 
U. UUUU 


ft nnnn 
U .UUUU 


undef 


undef 


Pankireas 


ft nnnn 


ft ftftftft 
u ■ MvW 


undef 


undef 




ft nnnn 

u - UUUU 


n nnnn 
U • UUUU 


undef 


undef 




0.0065 


0.0064 


1.0236 0.9769 


\J CtllUUUUS UfJLUEu 


0. 0000 


0.0000 


undef 


undef 


Uterus_Myometrium 


0.0000 


0.0000 


undef 


undef 


0 1 e ru s_a 1 1 g emein 


0.0000 


o.opco 


undef 


undef 


Bruat-flyperplasie 


0.0000 






Prostata-Hyperplasie 


0.0178 








Saroenblase 


0.0000 








Sinnesorgane 


0.0000 








We is se_Blu t koerper chen 


0.0000 








Zervix 


0.0000 










FOETUS 










%Haeufigkeit 








Entwicklung 


0.0000 








Gastrointenstinal 


0.0028 








Gehirn 


0.0000 








Haema t opoe ti sen 


0. 0000 








Haut 


0.0000 








Hepatisch 


0.0000 








Herz-Blutgefaesse 


0.0000 








Lunge 


0.0000 








Nabenniere 


0.0000 








Niere 


0.0000 








Placenta 


0.0000 








Prostata 


0.0000 








Sinnesorgane 


0.0000 









NORMIERTE/ SUBTRAHIEBTE BIBLIOTHEKEN 

%Haeufigkeit 

Brust 0.8000 

Eierstock n 0.0000 

Eierstock~t 0.0000 

Endokrines Gewebe 0.0000 

"~Foetal 0.0035 

.Gastrointestinal 0.0000 

. Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0000 

Prostata 0.0068 

Sinnesorgane 0.0000 

Oterus n 0.0000 



In analogcr Vcrfahrensweise wurden auch folgendc Northerns gefunderi: 
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Elektronischer Northern fur SEQ. ID. NO: 8 



10 



15 



20 



25 



30 



35 



40 



45 



50 



5S 



60 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines Gewebe 
Gastrointestinal 
Gehirn 
Ha proa t opoe t i s ch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Hiere 
Pankreas 
Penis 
Prostata 
Uteru3__Endometrium 
Uterus_Myametrium 
Uteras^allgemein 
Brust-Byperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weis se_Blutkoerperchen 
Zervix 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebennicre 
Niere 
Placenta 
Prostata 
Sinnesorgane 



Brust 
Eierstock_a 
Eierstock_t 
Endokrines_Geviebe 
Foetal 

* Gastrointestinal 

* Haematopoetisch 

Haut-Muskel 



TUMOR 

%Haeufigkeit 

0.0281 

0.0056 

0.0000 

0.0078 

0.0000 

0.0000 

0.0010 

0.0000 

0.0000 

0.0323 

0.0000 

0.0585 

0.0123 

0.0000 

0.0120 

0.0274 

0.0U0 

0.0000 

0.0021 

0.00C0 

0.CC00 • - 

0.0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
0.6805 1.4694 
undef 0.0000 
0.0000 undef 
undef undef 
undef 0.0000 
0.0000 undef 
.undef 0.0000 
undef 0.0000 
0.0000 undef 
undef undef 
0.0000 undef 
1.1854 0.8436 
undef undef 
0.9994 1.0006 
0.0000 undef 
0.5983 1.6714 
undef undef 
1.0236 0.9769 
undef undef: " 
uuidef under 
undef undef 



NORMAL 

%Haeufigkeit 

0.0000 

0.0038 

0.0399 

0.0000 

0.0000 

0.0479 

0.0000 

0.0227 

0.0037 

0.0000 

0.0000 

0.0000 

0.0145 

0.0000 

0.0120 

0.0000 

0.0066 

0.0000 

0.0022 

0.0000 

0.0000 

0.0000 

0.0128 

0.0030 

0.0000 

0.0000 

0.0009 

0.0000 



FOETUS 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0039 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



NORMIERTE /SUBTRAHIERTE BIBLI0THEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



Hoden 0."O000 
Lunge 0.0000 



Nerven 
Prostata 
Sinnesorgane 
Uterus n 



0.0000 
0.0000 
0.0000 
0.0000 



65 



6 



DE 198 18 619 A 1 

Elektronischer Northern fiir SEQ. ID. NO: 2 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Blase 


0.0000 


0.0307 


0. 


0000 


undef 


Brust 


0.0307 


0.0376 


- 0. 


8166 


1.2245 


. Duenndarm 


0.0337 


0.0165 


2. 


0391 


0.4904 


Eierstock 


0.0120 


0.0364 


0. 


3289 


3.0402 


Endokrines__Gewebe 


0.0255 


0.0075 


3- 


3962 


0.2944 


Ga s t roin t es t ina 1 


0.0153 


0.0185 


0. 


8283 


1.2072 


Gehirn 


0.0185 


0.0216 


•0. 


8571 


1.1667 


Haematopoetisch 


0.0201 


0.0379 


0. 


5293 


1.8892 


Haut 


0.0844 


0.0000 


undef 


0.0000 


Hepatisch 


0.0238 


0.0065 


3. 


6765 


0.2720 


Herz 


0.0148 


0.0000 


undef 


0.0000 


Hoden 


0.0575 


0.0351 


1. 


6399 


0.6098 


Lunge 


0.0145 


0.0082 


1. 


7781 


0.5624 


Mag en-Spe i s e roehre 


0.0387 


0.0077 


.5- 


0421 


0.1983 


' Muskel-Skelett 


0.0308 


0.0300 


1. 


0280 


0.9728 


Niere 


0.0217 


0.0000 


undef 


0.0000 


Pankreas 


0.0099 


0.0110 


0- 


8974 


1.1143 


Penis 


0.0240 


0.0000 


undef 


0.0000 


Prostata 


0.0262 


0.0213 


1. 


2284 


0.8141 


Uterus_ Endometrium 


0.0135 


0.0000 


undef 


0.0000 


U t e rus_Myome t r ium 


0.0152 


0.0408 


0. 


3741 


2.6732 


0 te rus_al 1 geme in 


0.0204 


0.0954 


0. 


2135 


4.6639 


Brust-Hyperplasie 


0.0512 











Prostata-Hyperplasie 0.0268 

Saznenblase 0.0089 

Sinnesorgane 0.0235 

Weis3e_Blutkoerperchen 0.0286 

Zervix 0.0106 



10 



IS 



20 



25 



30 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit- 

0.0278 

0.0305 

0.0063 

0.0157 , 

0.0000 

0.0000 

0.0213 

0.0289 

0.0000 

0.0185 

0.0121 

0.0000 

0.0126. 



35 



45 



Brust 
Eierstock_n 
Eierstock^t 
Endokr ine s_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 0.0100 
Prostata 0.0137 
Sinnesorgane 0.0000 
Uterus n 0.0125 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0204 
0.0000 
0.0051 
0.0000 
0.0122 
0.0488 
0.0000 
0.0000 
0.0463 
0.0164 



50 



55 



60 



65 



7 
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Elektronischer Northern fiir SEQ. ID, NO: 3 



10 



15 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


SHaeufigkeit N/T 


T/N 


Blase 


0.0000 


0.0256 


0.0000 undef 


Brust 


0.0000 


0.0000 


undef 


undef 


Duenndarm 


0.0000 


0.0000 


undef 


undef 


Eierstock 


0.0000 


0.0000 


undef 


undef 


Endokrines_Gewebe 


0.0000 


0.0000 


undef 


undef 


Gastrointestinal 


0.0000 


0.0000 


undef 


undef 


Gehirn 


0.0000 


0.0000 


undef 


undef 


Haematopoetisch 


0.0000 


0.0000 


undef 


undef 


Haut 


0.0000 


0.0000 


undef 


undef 


Hepatisch 


0.0000 


0.0000 


undef 


undef 


Herz 


0.0000 


0.0000 


undef 


undef 


Hoden 


0.0000 


0.0000 


undef 


undef 


Lunge 


0.0000 


0.0000 


undef 


undef 


Magen-Speiseroehxe 


0.0000 


0.0000 


undef 


undef 


' Muskel-Skelett 


0.0000 


0.0000 


undef 


undef 


Niere 


0.0000 


0.0000 


undef 


undef 


Pankreas 


0.0000 


0.0000 


undef 


undef 


Penis 


0.0030 


0.0000 


undef 


0.0000 


Prostata 


0.0000 


0.0000 


undef 


undef 


0 t e rus_Endome t r i urn 


0.0000 


0.0000 


undef 


undef 


UterusJMyometrium 


0.0000 


0.0000 


undef 


undef 


Uterus_allgemein 


0.0000 


0.0000 


undef 


undef 


Brus t-Hype rpl a s i e 


0.0000 








rostata-Hyperplasie 


0.0000 








Samenblase 


0.0000 








S inne sorgane 


0.0000 









Weisse^Blutkoerperchen 0.0000 
Zervix 0.0000 



FOETUS 

35 %Haeufigkeit 

Entwicklung 0.0000 

Gastrointenstinal 0-0000 

Gehirn 0.0000 

Haematopoetisch 0.0000 

40 Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0000 

Lunge 0.0000 

Nebenniere 0.0000 

45 Niere 0.0000 

Placenta 0.0000 

Prostata 0.0000 

Sinnesorgane 0.0000. 



50 



55 



60 



65 



Brust 
Eierstock_n 
Eierstock^_t 
Endokrinesj Gewebe 
. Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Dterus n 



NORMIERTE/ StJBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 
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Elektronischer Northern fiir SEQ. ID. NO: 4 




NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


0.0000 


0.0230 


0.0000 undef 


Brust 


0.0090 


0.0094 


0.9527 1.0496 


Duenndarm 


0.0123 


0.0165 


0.7415 1.3487 


Eierstbck 


0.0150 


0.0104 


1.4391 0.6949 


Endokrines_Gewebe 


0.0068 


0.0125 


0.5434 1.8403 


Gastrointestinal 


0.0038 


. 0.0046 


0.8283 1.2072 


Gehirn 


0.0044 


0.0123 


0.3600 2.7779 


Haematopoe tis ch 


0.0187 


0.0000 


undef 0.0000 


Haut 


0.0000 


0.0000 


undef undef 


Hepatisch 


0.0000 


0.0388 


0.0000 undef 


Herz 


0.0138 


0.0000 


undef 0.0000 


Hoden 


0.0173 


0.0000 


undef 0.0000 


Lunge 


0.0093 


0.0143 


0.6532 1.5310 


Magen-Speiseroehre 


0.0097 


0.0000 


.undef 0.0000 


Muskel-Skelett 


0.0017 


0.0180 


0.0952 10.5060 


Niere 


0.0136 


0.0137 


0.9913 1.0088 


Pankreas 


0.0149 


0.0055 


2.6923 0.3714 


Penis 


0.0150 


0.0000 


undef 0.0000 


Prostata 


0.0065 


0.0064 


1.0236 0.9769 


Uteru3_Endometrium 


0.0000 


0.0000 


undef undef 


UterusJMyometrium 


0.0000 


0.0136 


0.0000 undef 


Oterusallgemein 


0.0000 


0.0000 


undef undef 



Brust-Hyperplasie 0.0032 
Pros tat a- Hype rpl as ie 0.0178 
Samenblase 0.0089 
Sinnesorgane 0.0000 
Weisse_Blutkoerperchen 0 . 0035 
Zervix 0.0000 



to 



15 



20 



25 



30 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit' 

0.0000 

0.0167 

0.0000 

0.0354 

0.0000 

0.0000 

0.0000 

0.0072 

0.0000 

0.0062 

0.0000 

0.0000 

0.0000. 



3S 



40 



45 



Brust 
Eierstock_n 
EierstocJc]_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 0.0114 
Haut-Muskel 0.0259 
Hoden 0.0077 
Lunge 0.0000 
N erven 0.0060 
Prostata 0.0068 
Sinnesorgane 0.0155 
Uterus n 0.0250 



NORMIERTE/SDBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0272 
0.0000 
0.0051 
0.0000 
0.0151 
0.0000 



50 



55 



60 



65 



9 
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Elektronischcr Northern fur SEQ. ID. NO: 5 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatiscb 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Dte rus_Endomet r ium 
U t e rus_Myome t r ium 
Uteru3__allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
WeisseJBlutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0000 
0.0026 
0.0031 
0.0060 
0.0051 
0.0134 
0.0052 
0.0000 
0.0000 
0.0000 
0032 
0000 
0031 
0000 
0034 
0190 
0017 
0.0000 
0.0131 
0068 
0000 
0000 
0000 
0059 
0000 
0.0000 
0.0000 
0.0000 



TUMOR 

%Haeufigkeit 

0.0204 

0.0056 

0.0000 

0.0156 

0.0000 

0.0093 

0.0051 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0020 

0.0000 

0.0060 

0.0068 

0.0000 

0.0000 

0.0043 

0.0000 

0.0000 

0.0000 



Verhaeltnisse 
N/T T/N 
0 . 0000 undef 
0.4537 2.2042 
undef 0.0000 
0.3838 2.6058 
undef 0.0000 
1.4496 0.6898 
1.0079 0.9921 
undef undef 
undef undef 
undef undef 
undef 0.0000 
undef undef 
1.5241 0.6561 
undef undef 
0.5711 1.7510 
2.7756 0.3603 
undef 0.0000 
undef undef 
3.0709 0.3256 
undef 0.0000 
undef undef 
undef undef 



35 



40 



45 



Entwicklungr 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0083 

0.0063 

0.0079 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



50 



55 



60 



65 



Brust 
Eierstock n 
Eierstock_t 

Endokrines_Gewebe 0.0000 

Foetal 0.0000 

Gastrointestinal 0.0000 

Haematopoetisch 0.0000 

— Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0082 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0000 
0.0000 
0.0051 



N erven 
Prostata 
Sinnesorgane 
Uterus n 



0.0050 
0.0137 
0.0000 
0.0000 



10 
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Elektronischer Northern fur SEQ. ID. NO: 6 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeuf igkeit 


N/T T/N 


Blase 


0.0000 


0.0204 


0.0000 undef 


Brust 


0.0038 


0.0075 


0.5104 1.9593 


Duenndarm 


0.0031 


0.0000 


undef 0.0000 


Eierstock 


0.0060 


0.0052 


1.1513 0.8686 


Endokrines_Gewebe 


0.0017 


0.0000 


undef 0-0000 


Gastrointestinal 


0.0038 


0.0000 


undef 0.0000 


Gehirn. 


0.0022 


0.0021 


1.0799 0.9260 


Haema topoe t i sch 


0.0053 


0.0000 


undef 0.0000 


Haut 


0.0000 


0.0000 


undef undef 


Hepatisch 


0.0048 


0.0000 


undef 0.0000 


flerz 


0.0042 


0.0137 


0.3084 3.2426 


Hoden 


0.0000 


0.0000 


undef undef 


Lunge 


0.0052 


0.0143 


0.3629 2.7557 


Magen-Speiseroehre 


0.0000 


0.0000 


undef undef 


Muskel-Skelett 


0.0017 


0.0060 


0.2856 3.5020 


Niere 


0.0000 


0.0000 


undef undef 


Pankreas 


0.0017 


0.0110 


0.1496 6.6857 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0000 


0.0021 


0.0000 undef 


D t e rua_Endome tr ium 


0.0135 


0.0000 


undef 0.0000 


Oterus_Myometrium 


0.0229 


0.0068 


3.366B 0.2970 


Uterus__allgemein 


0.0204 


0.0000 


undef 0.0000 


B rust-Hype rplasie 


0.0000 







Prostata-Hyperplasie 0.0000 

Samenblase 0.0000 

Sinnesorgane 0 . 0353 

Weisse_Blutkoerperchen 0.0009 

Zervix 0.0000 



Entwicklung 
Gastrointenstioal 
Gehim 
Haema topoe tisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0118 

0.0000 

0.0000 

0.0000 

0.0036 

0.0000 

0.0124 

0.0242 

0.0000 

0.0000 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 
%Haeuf igkeit " 

Brust 0.0000 

Eierstockji 0.0000 

Eierstock t 0.0000 

Endokrines_GeweI>e 0.0245 

Foetal 0.0151 

Gastrointestinal 0.0000 

Haematopoetisch 0.0000 

Haut-Muskel 0.0259 

Hoden 0.0000 

Lunge 0.0164 

Nerven 0.0080 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0250 



11 



DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ED. NO: 7 



10 



15 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


0.0000 


0.0179 


0.0000 undef 


Brust 


0.0038 


0.0132 


. 0.2917 3.4287 


Duenndarzu 


0.0061 


0.0000 


undef 0.0000 


Eierstock 


0.0060 


0.0052 


1.1513 0.8686 


Endokrines__Gewebe 


0.0017 


0.0100 


0.1698 5.8889 


Gastrointestinal 


0.0019 


0.0370 


0.0518 19.3158 


Gehirn 


0.0096 


0.0051 1 


1.8719 0.5342 


Haematopoetisch 


0.0067 


0.0000 


undef 0.0000 


Haut 


0.0000 


0.0000 


undef undef 


Hepatisch 


0.0048 


0.0065 


0.7353 1.3600 


Herz 


0.0138 


0.0137 


1.0023 0.9977 


Hoden 


0.0288 


0.0468 


0.6150 1.6261 


Lunge 


0.0031 


0.0143 


0.2177 4.5929 


Magen- Spei s eroehr e 


0.0387 


0.0153 


2.5211 0.3967 


* Muskel-Skelett 


0.0017 


0.0000 


undef 0.0000 


Niere 


0.0054 


0.0000 


undef 0.0000 


Pankreas 


0.0050 


0.0055 


0.8974 1.1143 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0000 


0.0021 


0.0000 undef 


Uterus_Endometrium 


0.0000 


0.0000 


undef undef 


Ut e ru3_Myomet r ium 


0.0000 


0.0000 


undef undef 


D t e rus_allgeme in 


0.0102 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0000 







Prostata-Hyperplasie 0.0000 
Samenblase 0.0000 
Sinnesorgane 0.0000 
Weisse_Blutkoerperchen 0 . 0026 
Zervix 0.0000 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
S innes organe 



FOETUS 

%Haeufigkeit 

0.0000 

0.0028 

0.0751 

0.0079 

0.0000 

0.0000 

0.0036 

0.0036 

0.0000 

0.0000 

0.0061 

0.0499 

0.0000 



50 



55 



60 



65 



Brust 
Eierstockjn 
Eierstock^t 
Endokrines__Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 0.0114 
Haut-Muskel 0.0097 
Hoden 0.0540 
Lunge 0.0082 
Nerven 0.0201 
Prostata 0.0205 
Sinnesorgane 0.0000 
Uterus n 0.0375 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0068 
0.0000 
0.0000 
0.0000 
0.0087 
0.0122 



12 



DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 8 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0000 


0.0179 


0.0000 undef 


Brust 


0.0013 


0.0038 


. 0.3403 2.9389 


Duenndarm 


0.0123 


0.0000 


undef 0.0000 


Eierstock 


0.0060 


0.0078 


0.7675 1.3029 


Endokrines_Gewebe 


0.0068 


0.0100 


0.6792 1.4722 


Gastrointestinal 


0.0038 


0.0093 


0.4142 2.4145 


Gehim 


0.0044 


0.0144 


0.3086 3.2409 


Haeroatopoetisch 


0.0080 


0.0000 


undef 0.0000 


Haut 


0.0037 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0000 


undef undef 


Herz 


0.0021 


0.0000 


undef 0.0000 


Hoden 


0.0058 


0.0000 


undef 0.0000 


Lunge 


0.0031 


0.0082 


" 0.3810 2.6245 


Magen- Spe i s er oehr e 


0.0000 


0.0307 


0.0000 undef 


' Muskel-Skelett 


0-0051 


0.0060 


0.8567 1.1673 


Niere 


0.0000 


0 . 0000 


undef undef 


Pankreas 


0.0066 


0.0000 


undef 0.0000 


Penis 


0.0060 


0.0000 


undef 0.0000 


Prostata 


0.0087 


0.0064 


1.3648 0.7327 


Dteru3_Endametrium 


0.0000 


0.0000 


undef undef 


Uterus_Myometrium 


0.0000 


0.0068 


0.0000 undef 


0 1 erus_a 11 geme in 


0.0051 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0000 







Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
WeisseJBlutkoerperchen 
Zervix 



0.0059 
0.0000 
0.0000 
0.0035 
0.0000 



10 



ts 



.20 



25 



30 



Entwicklung 
Gastrointenstinal 
Gehim 
Haexnatopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge' 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0056 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0036 

0.0000 

0.0000 

0.0061 

0.0000 

0.0000: 



35 



40 



45 



Brust 
Eierstock n 
Eierstoc)c_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haexnatopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/SUSTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0204 

0.0000 

0.0608 

0.0000 

0.0029 

0.0244 

0.0000 

0.0097 

0.0154 

0.0000 

0.0080 

0.0000 

0.0000 

0.0000 



50 



55 



60 



65 



13 



DE 198 18 619 A 1 



Bektronischer Northern fur SEQ. ID. NO: 9 



15 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf igkeit 


%Haeufigkeit 


N/T 


T/N 


Blase 


0.0000 


0.0179 


0.0000 


undef 


Brust 


0.0064 


0.0056 


1.1342 


0.8817 


Duenndann 


0.0000 


0.0165 


0.0000 


undef 


Eierstock 


0.0030 


0.0026 


1.1513 


0.8686 


Endokrines_Gewebe 


0.0153 


0.0025 


6.1132 


0.1636 


Gastrointestinal 


0.0038 


0.0046 


0.8283 


1.2072 


Gehirn 


0.0015 


0.0051 


0.2880 


3.4724 


Haematopoetisch 


0.0067 


0.0000 


undef 


0.0000 


Haut 


0.0037 


0.0847 


0.0433 


23.0839 


Hepatisch 


0.0000 


0.0065 


0.0000 


undef 


Herz 


0.0053 


0.0000 


undef 


0.0000 


Hoden 


0.0230 


0,0234 


0.9839 


1.0163 


Lunge 


0.0042 


0.0041 


1.0161 


0.9842 


Magen-Speiseroehre 


0.0000 


0.0077 


0. 0000 


undef 


' Muskel-Skelett 


0.0034 


0.0000 


undef 


0.0000 


Niere 


0.0054 


0.0068 


0.7930 


1.2610 


Pankreas 


0.0017 


0.0000 


undef 


0.0000 


Penis 


0.0030 


0.0267 


0.1123 


8.9035 


Prostata 


0.0065 


0.0064 


1.0236 


0.9769 


0t erusJEndome t r ium 


0.0135 


0.0000 


undef 


0.0000 


Uterus_Myometrium 


0.0152 


0.0000 


undef 


0.0000 


Dt erus_a 1 Igemein 


0.0000 


0.0000 


undef 


undef 


Brust-Hyperplasie 


0.0032 








Prostata-Hyperplasie 


0.0000 








Samenblase 


0.0000 








Sinnesorgane 


0.0000 








Wei s se_Blut koe rpe rchen 


0.0043 








Zervix 


0.0000 









35 



40 



45 



Entwicklung 
Gas t rointens t inal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeuf igkeit 

0.0000 

0.0000 

0.0000 

0.0039 

0.0000 

0.0000 

0.0036 

0.0108 

0.0000 

0.0062 

0.0000 

0.0499 

0.0000 



50 



55 



60 



65 



Brust 
Eierstock_n 
EierstockM: 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
« i Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/ SUBTRAHXERTE BIBLIOTHEKEN 

% Ha euf igkeit 

0.0068 

0.0000 

0.0101 

0.0000 

0.0047 

0.0000 

0.0114 

0.0194 

0.0309 

0.0000 

0.0040 

0.0000 

0.0000 

0.0125 



14 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ED. NO: 10 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Blase 


0.0000 


0.0153 


0.0000 undef 


Brust 


0.0051 


0.0019 


2.7221 0.3674 


Duenndarm 


0.0000 


0.0000 


undef 


undef 


Eierstock 


0.0000 


0.0000 


undef 


undef 


Endokrines_Gewebe 


0.0000 


0.0000 


undef 


undef 


Gastrointestinal 


0.0000 


. 0.0093 


0,0000 undef 


Gehirn 


0.0000 


0.0113 


0.0000 undef 


Haematopoetisch 


0.0000 


0.0000 


undef 


undef 


Haut 


0.0000 


0.0000 


undef 


undef 


Hepatisch 


0.0000 


0.0129 


0.0000 undef 


Herz 


0.0011 


0.0000 


undef 


0.0000 


Hoden 


0.0058 


0.0000 


undef 


0.0000 


Lunge 


0.0021 


0.0000 


undef 


0.0000 


Magen-Speiseroehre 


0.0000 


0.0000 


undef 


undef 


MusJcel-Skelett 


0.0000 


0.0000 


undef 


undef 


Niere 


0.0000 


0.0000 


undef 


undef 


Pankreas 


0.0000 


0.0000 


undef 


undef 


Penis 


0.0000 


0.0000 


undef 


undef 


Prostata 


0.0000 


0.0000 


undef 


undef 


Ute rus_Endometrium 


0.0000 


0.0000 


undef 


undef 


Uterus^ Myometrium 


0.0000 


0.0000 


undef 


undef 


Dterus_allgemein 


0.0000 


0.0000 


undef 


undef 



Brust-Hyperplasie 0.0000 

Prostata-Hyperplasie 0.0000 

Samenblase 0.0000 

Sinnesorgane 0.0000 

Weisse__Blutkoerperchen 0.0000 

Zervix 0.0000 



FOETUS 

%Haeufigkeit* 
Entwicklung 0.0000 
Gastrointestinal 0.0000 
Gehirn 0.0000 
Haematopoetisch 0.0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0036 
Lunge 0.0000 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000. 



NORMIERTE /SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0000 
Eierstock_n 0.0000 
EierstockfJ; 0.0000 ( - 
Endokrines_Gewebe. 0.0000 
Foetal 0.0006 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
— Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0010 
Prostata" 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 



15 



DE 198 18 619 A 1 



Elektronischer Northern filr SEQ. ID. NO: 11 



10 



15 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Blase 


0. 


0000 


0.0153 


0.0000 


undef 


Brust 


0. 


0038 


0.0038 


1.0208 


0.9796 


Duenndarra 


0. 


0000 


0.0000 


undef 


undef 


Eierstock 


0. 


0000' 


0.0104 


0.0000 


undef 


Endokrines_Gewebe 


0- 


0068 


0.0025 


2.7170 


0.3681 


Gastrointestinal 


0. 


0077 


.. 0.0093 


0.8283 


1.2072 


Gehirn 


0. 


0000 


0*0062 


0.0000 


undef 


Haematopoetisch 


0. 


0107 


0.0000 


undef 


0.0000 


Uaut 


0. 


0073 


0.0000 


undef 


0.0000 


Hepatisch 


0. 


0000 


0.0194 


0.0000 


undef 


Herz 


0, 


0021 


0.0000 


undef 


0.0000 


Hoden 


0. 


0000 


0.0000 


undef 


undef 


Lunge 


0. 


0010 


0.0041 


0.2540 


3.9367 


Magen-Speiseroehre 


0. 


0097 


0.0077 


.1.2605 


0.7933 


Muskel-Skelett 


0. 


0000 


0.0000 


undef 


undef 


Niere 


0. 


0054 


0.0068 


0.7930 


1.2610 


Pankreas 


0. 


0050 


0.0000 


undef 


0.0000 


Penis 


0. 


0000 


0.0267 


0.0000 


undef 


Prostata 


0. 


0065 


0.0298 


0.2193 


4.5590 


UterusJEndometrium 


0. 


0068 


0.0528 


0.1280 


7.8106 


Dterus_Myometrium 


0. 


0076 


0.0068 


1.1223 


0.8911 


Uterus_allgemein 


0. 


0000 


0,0000 


undef 


undef 


Brust-Hyperplasie 


0. 


0000 








costata-Hyperplasie 


0. 


0238 








Samenblase 


0. 


0000 








Sinnesorgane 


0. 


0000 









Weisse_ Blutkoerperchen 0.0026 
Zervix 0.0106 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
. Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeuf igkeit* 

0.0139 

0.0194 

0.0063 

0.0000 

0.0000 

0.0260 

0.0071 

0.0108 

0.0000 

0.0000 

0.0121 

0.0499 

0.0000. 



50 



5S 



60 



65 



Brust 
Eierstock^n 
Eierstock^t 
Endokrines^Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0557 :; 

0.0000 

0.0076 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0030 

0.0137 

0.0387 

0.0042 



16 



DE 198 18 619 A 1 



Elcktronischer Northern fur SEQ. ID. NO: 12 





NORMAL 


TUMOR 


Ve rhaeltni s s e 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0. 0000 


0.0153 


0.0000 undef 


Brust 


0.0013 


0.0000 


undef 0.0000 


Duenndann 


0.0031 


0.0000 


undef 0.0000 


Eierstock 


0.0000 


0.0026 


0.0000 undef 


Endokrines Gewebe 


0.0017 


0.0075 


0.2264 4.4166 


Gastrointestinal 


0.0000 


0.0093 


0.0000 undef 


Gehirn 


0.0044 


0.0000 


undef 0.0000 


Haematopoetisch 


0,0000 


0.0000 


undef undef 


Haut 


0.0037 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0000 


undef undef 


Herz 


0.0000 


0.0000 


undef undef 


Hoden 


0.0000 


0.0117 


0.0000 undef 


Lunge 


0.0000 


0.0000 


undef undef 


Magen-Speiseroehre 


0.0000 


0.0000 


•undef undef 


Muskel-Skelett 


0.0000 


0.0000 


undef undef 


Niere 


0.0000 


0.0000 


undef undef 


Pankreas 


0.0017 


0.0000 


undef 0.0000 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0000 


0.0021 


0.0000 undef 


Uterus^JEndometrium 


0.0000 


0.0000 


undef undef 


U t e rus_Myome t rium 


0.0000 


0.0000 


undef undef 


D te rus_al lgemein 


0.0000 


0.0000 


undef undef 


Brus t-Hype rpl as ie 


0.0032 







Prostata-Hyperplasie 0.0000 

Samenblase 0.0000 

Sinnesorgane 0,0000 

Wei sse_Blutkoerper chen 0.0000 

Zervix 0.0000 



FOETUS 

%Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0.0000 
Gehirn 0.0125 
Haematopoetisch 0.0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0000 
Lunge 0.0036 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0.000O: 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0000 
Eierstock_n 0.0000 
EierstockJ; 0.0000 i; 
Endokrines_Gewebe 0.0245 
Foetal 0.0023 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0082 
Nerven 0.0020 
Prostata 0.0000 
Sinnesorgane 0.0000 
Oterus n 0.0000 



17 



DE 198 18 619 A 1 . 

Elektronischer Northern fur SEQ. ID. NO: 13 



10 



is 



20 



25 



30 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0000 


0.0153 


0.0000 undef 


Brust 


0.0090 


0.0169 


0.5293 1.8893 


Duenndarm 


0.0031 


0.0000 


undef 0.0000 


Eierstock 


0.0060 


0.0052 


1.1513 0.8686 


Endokrines_Gewebe 


0.0051 


0.0226 


0.2264 4.4166 


Gastrointestinal 


0.0057 


0.0231 


0.2485 4.0241 


Gehirn 


0.0052 


0.0082 


0.6300 1.5874 


Haema topoe t i s ch 


0.0107 


0.0000 


undef 0.0000 


Haut 


0.0037 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0000 


undef undef 


Her z 


0.0138 


0.0000 


undef 0.0000 


Hoden 


0.0345 


0.0117 


2.9518 0.3388 


Lunge 


0.0021 


0.0123 


0.1693 5.9051 


Magen-Speiseroehre 


0.0000 


0.0000 


undef undef 


Muskel-Skelett 


0.0051 


0.0060 


0.8567 1.1673 


Niere 


0.0163 


0.0068 


2.3791 0.4203 


Pankreas 


0.0000 


0.0055 


0.0000 undef 


Penis 


0.0030 


0.0267 


0.1123 8.9035 


Prostata 


0.0174 


0.0128 


1.3648 0.7327 


Uterus_Endometrium 


0.0068 


0.0000 


undef 0.0000 


Uterus_Myometrium 


0.0000 


0.0000 


undef undef 


Uterus__allgemein 


0.0051 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0000 







Prostata-Hyperplasie 0.0149 

Samenblase 0.0267 

Sinnesorgane 0.0118 

Weiss e_Blutkoerperchen 0.0087 

Zervix 0.0000 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haema topoet is ch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nabenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETDS 

%Haeufigkeit* 

0.0139 

0.0139 

0.0063 

0.0079 

0.0000 

0.0000 

0.0107 

0.0072 

0.0000 

0.0000 

0.0061 

0.0249 

0.0000. 



50 



55 



60 



65 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0000 
0.0000 
0.0101 
0.0000 
0.0146 
0.0000 



Eierstock_n 
Eierstock]_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haexnatopoetisch 0.0057 
Haut-Muskel 0.0000 
Hoden 0.0309 
Lunge 0.0000 
Nerven 0.0010 
Prostata 0.0137 
Sinnesorgane 0.0000 
Oterus n 0.0083 
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DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 14 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit 


N/T T/N 


Blase 


0.0000 


0,0153 


0.0000 undef 


Brust 


0.0051 


0.0395 


0.1296 7.7146 


Duenndann 


0. 0000 


6.0000 


undef undef 


Eierstock 


0.0060 


0.0234 


0.2558 3-9088 


Endokrines__Gewebe 


0. 0000 


0.0000 


undef undef 


Gastrointestinal 


0.0115 


0.0000 


undef 0.0000 


. Gehim 


0.0000 


0.0041 


0.0000 undef 


Haematopoetisch 


0. 0027 


0.0000 


undef 0.0000 


Haut 


0. 0037 


0.0000 


undef 0.0000 


Hepatisch 


0.0381 


0.0129 


2. 9412 0.3400 


Herz 


n nftii 
0. UU21 


a nnftn 
O.UUUU 


undef 0.0000 


Hoden 


0. 0000 


0.0234 


0. 0000 undef 


Lunge 


0.0073 


0.0245 


0.2964 3.3743 


Magen-Speiseroehre 


0. 0000 


0.0537 


0.0000 undef 


Muskel-Skelett 


0.0137 


0.0420 


0.3263 3.0643 


Niere 


0.0163 


0.0000 


undef 0.0000 


Pankreas 


0. 0017 


0.0331 


0.0499 20. 0570 


Penis 


0.0000 


0.0267 


0.0000 undef 


Prostata 


0.0022 


0.0128 


0.1706 5.8615 


Uterus_Endomet riura 


0.0068 


0.0000 


undef 0.0000 


Ote rus_Myoinet riura 


0.0305 


0.0136 


2.2445 0.4455 


Uterus_allgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0000 







Prostata-Hyperplasie 0.0000 

Samenblase 0.0000 

Sinnesorgane 0.0118 

WeisseJBlutkoerperchen 0.0000 

Zervix 0.0319 



FOETUS 

%Haeufigkeif 

Entwicklung 0.0278 

Gastrointenstinal 0 . 0583 

Gehim 0.0000 

Haematopoetisch 0.0354 

Haut 0.5025 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0071 

Lunge 0.0542 

Nebenniere 0.07 61 

Niere 0.1235 

Placenta 0.0727 

Prostata 0.0249 

Sinnesorgane 0.1255. 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0204 

Eierstock_n 0.0000 

Eierstock^t 0.0709 

EndokrinesjGewebe 0.0245 

Foetal 0.0175 

Gastrointestinal 0.0122 

Haematopoetisch 0.0000 

Haut-Muskel 0.0162 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0000 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0000 
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Elcktronischer Northern fiir SEQ. ID. NO: 15 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0039 


0.1278 


0.0305 32.7774 


Brust 


0.0179 


0.0414 


0.4331 2-3091 


Duenndarm 


0.0031 


0.1323 


0.0232 43.1571 


Eierstock 


0.0090 


0.0234 


0.3838 2.6058 


Endokrines_Gewebe 


0.0358 


0.0301 


1.1887 0.8413 


Gastrointestinal 


0.0019 


0.0000 


undef 0.0000 


Gehirn 


0.0007 


0.0010 


0.7200 1.3890 


Haemat opoet i sch 


0.0000 


0.0000 . 


undef undef 


Haut 


0.0037 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0065 


0.0000 undef 


Herz 


0.0244 


0.0000 


undef 0.0000 


Hoden 


0.0000 


0.0000 


undef undef 


Lunge 


0.0374 


0.0491 


0.7621 1.3122 


Magen-Speiseroehre 


0.0290 


0.0077 


3.7816 0.2644 


Muskel-Skelett 


0.0000 


0.0000 


undef undef 


Niere 


0.0027 


0.0068 


0.3965 2.5219 


Pankreas 


0.0017 


0.2209 


0.0075 133.7133 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0065 


0.0085 


0.7677 1.3026 


Uterus_Endometrium 


0.0000 


0.0000 


undef undef 


Uterus_Myometrium 


0.0152 


0.0000 


undef 0.0000 


Uterus^allgemein 


0.0000 


0.0000 


undef undef 


Brust -Hype rp las ie 


0.0192 







Prostata-Hyperplasie 0.0030 

Samenblase 0.0267 

Sinnesorgane 0.0000 

Weisse_Blutkoerperchen 0.0000 

Zervix 0.0106 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haemat opoet i sch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0667 

0.0000 

o.oooo- 



50 



55 



60 



65 



Brust 
EierstocKjn 
Eierstock_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Ha emat opoet i s ch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



N ORMI ERTE / S UBTRAHI ERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.1224 

0.0000 

o.ioi3 ;; 

0.0000 
0.0268 
0.0122 
0.0057 
0.0065 
O.OOOO 
0.0246 
0.0000 
0.0205 
0.0000 
0.0125 



20 



DE 198 18 619 A 1 



Elcktronischer Northern fUr SEQ. ID. NO: 17 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0,0039 


0.0537 


0.0726 13.7665 


Brust 


0.0077 


0.0207 


0.3712 2.6940 


Duenndarm 


0-0368 


0.0000 


undef 0.0000 


Eierstock 


0.0150 


0.0078 


1.9188 0.5212 


Endokrines_Gewebe 


0.0102 


0.0100 


1.0189 0.9815 


Gastrointestinal 


0.0421 


0.0093 


4.5559 0.2195 


Gehirn 


0-0118 


0.0195 


0.6063 1.6494 


Haematopoetisch 


0.O174 


0.0379 


0.4587 2.1798 


Haut 


0.0110 


0.0000 


undef 0.0000 


Hepatisch 


0.0048 


0.0518 


0.0919 10.8799 


Herz 


0.0127 


0.0275 


0.4626 2.1618 


Hoden 


0.0115 


0.0117 


0.9839 1.0163 


Lunge 


0.0114 


0.0061 


1.8628 0.5368 


Magen-Speiseroehre 


0.0000 


0.0460 


0.0000 undef 


Muskel-Skelett 


0.0154 


0.0060 


2.5700 0.3891 


Niere 


0.O054 


0.0068 


0.7930 1.2610 


Pankreas 


0.0050 


0.0331 


0.1496 6.6857 


Penis 


0.0090 


0.0533 


0.1685 5.9357 


Prostata 


0.0174 


0.0192 


0.9099 1.0990 


Oterus_Endomet rium 


0.0068 


0.0000 


undef 0.0000 


Uterus_Myometrium 


0.0152 


0.0000 


undef 0.0000 


Uterusallgemein 


0.0204 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0064 






Prostata-Hyperplasie 


0.0238 






Samenblase . 


0.0000 






Sinnesorgane 


0.0000 






fleisse_Blutkoerperchen 


0.0251 






Zervix 

; 


0.0106 


* 






FOETUS 








%Haeufigkeit 






Entwicklung 


0.0000 






Gas troint ens t inal 


0.0167 






Gehirn 


0.0438 






Haematopoetisch 


0.0118 






Haut 


0.0000 






Hepatisch 


0.0000 






Herz-Blutgefaesse 


0.0107 






Lunge 


0.0181 






Nebenniere 


0.0000 






Niere 


0.0247 






Placenta 


0.0061 






Prostata 


0.0249 






Sinnesorgane 


0.0000. 







N ORMIERTE / SOBTRAHIERTE BIBLI0THEKEN 
%Haeufigkeit 
Brust 0.0408 
Eierstock,_n 0.0000 
Eierstock^t 0.0101 I 
Endokrines_Gewebe 0.0000 
Foetal 0.0087 
Gastrointestinal 0.0122 
Haematopoetisch 0.0000 
Haut-Muskel 0.0130 
Hoden 0.0Q00 
Lunge 0.0082 
Nerven 0.0010 
Prostata 0.0068 
Sinnesorgane 0.0000 
Oterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 18 



Blase 
Brust 
Duenndarm 
Eiorstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haemat opoe t is ch 
Haut 
Hepatisch 
Herz 
Hod en 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Dterus_Endometrium 
Dt e rus_Myome t r ium 
0 terus_al Igemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
We i s s e_Blu t koe rperchen 
Zervix 



NORMAL 

%Haeufigkeit 

0.0195 

0.0166 

0.0061 

0.0389 

0.0392 

0.0019 

0.0007 

0.0107 

0.0220 

0.0238 

0.0085 

0.0115 

0.0104 

0.0000 

0.0600 
0.0407 
0.0198 
0.0030 
0.0000 
0.0405 
0.0305 
0.0153 
0.0064 
0.0030 
0.0000 
0.0118 
0.0000 
0.0000 



TOMOR 


Verhaeltnisse 


%Haeufigkeit N/T T/N 


0.2556 


0.0763 13.1109 


0.0357 


■ 0.46562.1477 


0.0662 


0.0927 10.7893 


0.0052 


7.4832 0.1336 


0.0326 


1.2017 0.8321 


0.0000 


undef 0.0000 


0.0606 


0.0122 81.9491 


0.0000 


undef 0.0000 


0.5085 


0.0433 23.0839 


0,0518 


0.4596 2.1760 


0.0000 


undef 0.0000 


0.0000 


undef 0.0000 


0.0041 


2.5402 0.3937 


0.0077 


0.0000 undef 


0.0480 


1.2493 0.8005 


0.0068 


5.9478 0.1681 


0.0331 


0.5983 1.6714 


0.1066 


0.0281 35.6140 


0.0021 


0.0000 undef 


0.0000 


undef 0.0000 


0.1155 


0.2641 3.7870 


0.0000 


undef 0.0000 



FOETUS - 
%Haeufigkeit 
Entwicklung 0.0696 
Gastrointenstinal 0.3332 
Gehirn 0.0000 
Haematopoetisch 0.2202 
Haut 0.0000 
Hepatisch 1.6381 
Herz-Blutgefaesse 0.0285 
Lunge 0.1337 
Nebenniere 1 . 0 9 03 
Niere 0.6301 
Placenta 0.6786 
Prostata 0.0499 
Sinnesorgane 0.0000: 



NORMIERTE/ SOBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0544 
£ierstock^_n 0.0000 
Eierstockj; 0.1063 
Endokrines_Gewebe 0.0000 
Foetal 0.4264 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0030 
Prostata 0.0000 
Sinnesorgane 0.0000 
Oterus_n 0.0250, 
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Elektronischer Northern fur SEQ. ED. NO: 19 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeu£igkeit 


ftHaeuf igkeit N/T T/N 


Blase 


0.0195 


0.2301 


0.0847 11.7998 


Brust 


0.0192 


0.0113 


1.7013 0.5878 


Duenndarm 


0.0061 


0.0331 


0.1854 5.3946 


Eierstock 


0*0180 


0.0000 


undef 0.0000 


Endokrines_Gewebe 


0.0034 


0.1555 


0.0219 45.6387 


Gastrointestinal 


0.0000 


0.0370 


0.0000 undef 


Gehirn 


0.0214 


0.1561 


0.1374 7.2801 


Ha ema t opoe t i s ch 


0.0134 


0.0000 


undef 0.0000 


Haut 


0.0073 


0.0000 


undef 0.0000 


Hepatisch 


0.0048 


0.0000 


undef 0.0000 


Herz 


0.0085 


0.0000 


undef 0.0000 


Hoden 


0.0000 


0.0000 


undef undef 


Lunge 


0.0197 


0.0020 


9.6527 0.1036 


Magen-Speiseroehre 


0.0000 


0.0077 


•0.0000 undef 


Muskel-Skelett 


0.0034 


0.1320 


0.0260 38.5221 


Niere 


0.0109 


0.0068 


1.5861 0.6305 


Pankreas 


0.0083 


0.0000 


undef 0.0000 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0000 


0.0021 


0.0000 undef 


Uterus_Endometrium 


0.0203 


0.0000 


undef 0.0000 


Ut erus_ Myomet r ium 


0.0000 


0.0475 


0.0000 undef 


Oterus_allgemein 


0.0153 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0128 






Prostata-Hyperplasie 


0.0030 






Samenblase 


0.0000 






Sinnesorgane 


0.0235 






We i s se_Blut koerpe r chen 


0.0000 






Zervix 


0.0213 








FOETUS 








%Haeufigkeit* 






Entwicklung 


0.0417 






Gastrointenstinal 


0.1361 






Gehirn 


0.0063 






Haema t opoe t i s ch 


0.1337 






Haut 


0.0000 






Hepatisch 


0.3380 






Herz-Blutgefaesse 


0.0249 






Lunge 


0.0578 






Nebenniere 


0.5071 






Niere 


0.2594 






Placenta 


0.4120 






Prostata 


0.0000 






Sinnesorgane 


0.0000 







N ORMI ERTE / SUBTRAH IERTE BIBLIOTHEKEN 
fcHaeufigkeit 

Brust 0.0612 

Eierstock_n 0.0000 

Eierstock_t 0.0000 

Endokrines_Gewebe 0.0000 

Foetai 0.1188 

Gastrointestinal 0.0244 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0154 

Lunge 0.0000 

Nerven 0.0141 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0416 
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DE 198 18 619 A 1 



Elektroniscber Northern fur SEQ. ID. NO: 20 



10 



is 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0039 


0.0383 


0.1017 9.8332 


Brust 


0.0077 


0.0075 


1.0208 0.9796 


Duenndann 


0.0061 


0.0496 


0.1236 8.0920 


Eierstock 


0.0000 


0.0130 


0.0000 undef 


Endokrines_Gewebe 


0.0153 


0.0000 


undef 0.0000 


Gastrointestinal 


0.0115 


0.0000 


undef 0.0000 


Gehim 


0.0022 


0.0031 


0.7200 1.3890 


Haematopoe tiscb 


0.0013 


0 T 0000 


undef 0.0000 


Haut 


0.0073 


0.0000 


undef 0.0000 


Hepatisch 


0.0048 


0.0129 


0.3676 2.7200 


Her z 


0.0000 


0.0000 


undef undef 


Hoden 


0.0000 


0.0000 


. undef undef 


Lunge 


0.0042 


0.0102 


0.4064 2.4605 


Magen-Speiseroehxe 


0.0290 


0.0077 


3-7816 0.2644 


Muskel-Skelett 


0.0000 


0.0000 


undef undef 


Niere 


0.0136 


0.0000 


undef 0.0000 


Pankreas 


0.0017 


0.0000 


undef 0.0000 


Penis 


0.0030 


0.0000 


undef 0.0000 


Prostata 


0.0065 


0.0106 


0.6142 1.6282 


0 t e rus_Endome t r ium 


0.0068 


0.0528 


0.1280 7.8106 


Uterus_Myometrium 0.0000 


0.0000 


undef undef 


U t erus_a 1 lgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0032 







Prostata-Hyperplasie 0.0268 
Saxnenblase 0.0267 
Sinnesorgane 0.0235 
Weisse_Blutkoerperchen 0.0000 
Zervix 0.0319 



FOETUS 

%Haeufigkeit* 
Entwicklung 0.0139 
Gastrointenstinal 0.0139 
Gehirn 0.0000 
Haematopoetisch 0.0039 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0000 
Lunge 0.0036 
Nebenniere 0.0254 
Niere 0.0062 
Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000. 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0204 
Eierstock_n 0.0000 
Eierstock^t 0.0203 
Endokrines_Gewebe 0.0245 ' 
Foetal 0.0128 
Gastrointestinal 0.0122 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0154 
Lunge 0.0082 
Nerven 0.0090 
Prostata 0.0068 
Sinnesorgane 0.0000 
Uterus n 0.0000 



40 



60 
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Elektronischer Northern fur SEQ. ID. NO: 21 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
OterusJEndometrium 
Ute rus_Myome trium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
WeisseJBlutkoerperchen 
Zervix 



NORMAL 

%Haeufigkeit 

0.0039 



0077 
0215 
0030 
0000 
0.0575 
0.0000 
0.0214 
0.0037 
0.0000 
0.0000 
0.0000 
0.0104 
0.0000 
0.0257 
0.0054 
0.0099 
0.0000 
0.0022 
0.0000 
0.0000 
0.0000 
0.0096 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 



TUMOR 

%Haeufigkeit 

0.0358 

0.0207 

0.0331 

0.007B 

0.0000 

0.0046 

0.0072 

0.0000 

0.0000 

0.0194 

0.0137 

0.0585 

0.0225 

0.0000 

0.0120 

0.0000 

0.0221 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



Verhaeltnisse 
N/T T/N 
0.1090 9.1777 
0.3712 2.6940 
0.6488 1.5413 
0.3838 2.6058 
undef undef 
12.4251 0.0805 
0.0000 undef 
undef 0.0000 
undef 0.0000 
0.0000 undef 
0.0000 undef 
0.0000 undef 
0.4618 2.1652 
undef undef 
'2.1416 0.4669 
undef 0.0000 
0.4487 2.2286 
undef undef 
undef 0.0000 , 
undef undef 
undef undef 
undef undef 



10 



is 



20 



25 



30 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000. 



35 



40 



45 



Brust 
Eierstockjn 
Eierstock^_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
S innesorgane 
Uterus n 



NORMI ERTE / SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0136 

0.0000 

0.0051 

0.0000 

0.0000 

0.0122 

0.0000 

0.0000 

0.0000 

0.0164 . 

0.0000 - ' 

0.0000 

0.0000 

0.0000 
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25 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 22 



10 



15 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


0.0351 


0.3144 


0.1116 8.9591 


Brust 


0.0281 


0.0470 


0.5989 1.6698 


Duenndarm 


0.0092 


0.0662 


0.1390 7.1929 


Eierstock 


0.0569 


0.0208 


2.7342 0.3657 


Endo kr ines_Ge webe 


0.0596 


0.0527 


1.1321 0.8833 


Gastrointestinal 


0.0019 


.. 0.0139 


0.1381 7.2434 


Gehirn 


0.0022 


0.0945 


0.0235 42*5950 


Haematopoetisch 


0.0174 


0.0000 


undef 0.0000 


Haut 


0.0220 


0.5085 


0.0433 23.0839 


Hepatisch 


0.0285 


0.0582 


0.4902 2.0400 


Herz 


0.0223 


0.0000 


undef 0 . 0000 


Hoden 


0.0173 


0.0000 


undef 0.0000 


Lunge 


0.0145 


0.0082 


1.7781 0.5624 


Magen-Speiseroehre 


0.0290 


0.0077 


3.7816 0.2644 


" Muskel-Skelett 


0.0788 


0.0540 


1.4595 0.6852 


Niere 


0.0489 


0.0137 


3.5687 0.2802 


Pankreas 


0.0264 


0.0442 


0.5983 1.6714 


Penis 


0.0090 


0.1066 


0.0842 11.8713 


Prostata 


0.0000 


0.0064 


0.0000 undef 


0 te rus_Endamet r ium 


0.1013 


0.0000 


undef 0.0000 


Dterusj Myometrium 


0.0381 


0.1494 


0.2551 3.9206 


Oterus_allgemein 


0.0153 


0.0954 


0.1601 6.2452 


Brus t-Hyperplas ie 


0.0096 







Prbstata-Hyperplasie 0.0059 

Samenblase 0.0000 

Sinnesorgane 0.0118 

Weisse_Blutkoerperchen 0.0000 

Zervix 0.0000 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0696 

0.4554 

0.0000 

0.2753 

0.0000 

1.6381 

0.0605 

0.1879 

1.1663 

0.8215 

0.7816 

0.0499 

0.0000. 



50 



55 



60 



65 



Brust 
Eierstock_n 
Eierstock^_t 
Endokr ines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0612 

0.0000 

0.1164 

0.0000 

0.4665 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0030 

0.0068 

0.0000 

0.0291 
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Elektroniscber Northern fur SEQ. ID. NO: 23 





NORMAL 


TUMOR 


Verhaeltnlsse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0039 


0,0332 


0.1173 8.5221 


Brust 


0.0038 


0.0263 


. 0.1458 6.8574 


Duenndarm 


0.0184 


0.0331 


0.5561 1.7982 


Eierstock 


0.0150 


0.0156 


0.9594 1.0423 


Endokrines_Gewebe 


0.0170 


0.0075 


2.2642 0.4417 


Gastrointestinal 


0.0192 


0.0324 


0.5917 1.6901 


Gehirn 


0.0133 


0.0113 


1.1781 0.8488 


Haematopoetisch 


0.0174 


0.0000 


undef 0.0000 


Haut 


0.0220 


0.0000 


undef 0.0000 


Hepatisch 


0.0048 


0.0388 


0.1225 8.1599 


Herz 


0.0339 


0.0137 


2.4671 0.4053 


Hoden 


0.0288 


O.0234 


1.2299 0.8130 


Lunge 


0.0218 


0.0184 


1.1854 0.8436 


Magen- Spei s e r oehr e 


0.0000 


0.0307 


.0.0000 undef 


" Muskel-Skelett 


0.0291 


0.0240 


1.2136 0.8240 


Niere 


0.0136 


0.0137 


0.9913 1.0088 


Pankreas 


0.0050 


0.0166 


0.2991 3.3428 


Penis 


0.0120 


0.0000 


undef 0.0000 


Prostata 


0.0153 


0.0149 


1.0236 0.9769 


Uterus_Endometrium 


0.0203 


0.0000 


undef 0.0000 


Uterus_Myometrium 


0.0000 


0.0204 


0.0000 undef 


0teru3_allgemein 


0.0051 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0128 






Prostata-Hyperplasie 


0.0178 






Samenblase 


0.0267 






Sinnesorgane 


0.0118 






Wei sse_Blutkoerper chen 


0.0243 






Zervix 

* 


0.0213 








FOETUS 








%Haeufigkeit 






Entwicklung 


0.0139 






Gastrointenstinal 


0.0056 






Gehirn 


0.0125 






Haematopoetisch 


0.0000 






Haut 


0.0000 






Hepatisch 


0.0000 






Herz-Blutgefaesse 


0.0320 






Lunge 


0.0036 






Nebenniere 


0.0254 






Niere 


0.0124 






Placenta 


0.0121 






Prostata 


0.0000 






Sinnesorgane 


0.0000, 







NORMIERTE/ SOBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0204 

Eierstoc)^_n 0.0000 

EierstockM: 0.0354 

EndokrinesGewebe 0.0000 

Foetal 0.0105 

Gastrointestinal 0.0000 

Haematopoetisch 0.0171 

Haut-Muskel 0.0454 

Hoden 0.0000 

Lunge 0.0246 

Nerven 0.0211 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0000 
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Elektronischer Northern fiir SEQ. ID. NO: 24 





NORMAL 


TOMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0195 


0.1457 


0.1338 7.4732 


Brust 


0.0013 


0.0169 


. 0.0756 13.2250 


Duenndarm 


0.0061 


0.0496 


0.1236 8.0920 


Eierstock 


0.0090 


0.0260 


0.3454 2.8954 


Endokrines_Gewebe 


0.0119 


0.0125 


0.9509 1.0516 


Gastrointestinal 


0.1648 


0.0231 


7.1237 0.1404 


Gehirn 


0.0030 


0.0308 


0.0960 10.4173 


Haematopoetisch 


0.2099 


0.0000 


undef 0.0000 


Haut 


0.0000 


0.0000 


undef undef 


Hepatisch 


0.0000 


0.2006 


0.0000 undef 


Herz 


0.0032 


0.0137 


0.2313 4.3235 


floden 


0.0000 


0.0819 


0.0000 undef 


Lunge 


0.0956 


0.2965 


0.3223 3.1023 


Magen-Speiseroehre 


0.0097 


0.1917 


0.0504 19.8329 


' Muskel-Skelett 


0.0959 


0.0240 


3.9977 0.2501 


Niere 


0,0081 


0.0205 


0.3965 2.5219 


Pankreas 


0.0182 


0.0497 


0.3656 2.7350 


Penis 


0.0030 


0.1333 


0.0225 44.5175 


Prostata 


0.0131 


0.0064 


2.0473 0.4885 


D t e rus_Endome t r ium 


0.0000 


0.0000 


undef undef 


Oterus_Myometrium 


0.0000 


0.0000 


undef undef 


D te rus__ all gemein 


0.0051 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0096 







Prostata-Hyperplasie 0.0030 
Samenblase 0.0267 
Sinnesorgane 0.0706 
Weisse_Blutkoerperchen 0.0017 
• Zervix 0.0000 



FOETUS 

%Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0.0000 
Gehirn 0.0000 
Haematopoetisch 0.0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0000 
Lunge 0.0000 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0.000Q, 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0068 
Eierstock_n 0.0000 
Eierstoclc^t 0.2076 
Endokrines_Gewebe 0 . 0000 
Foetal 0.0006 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0082 
Nerven 0.0030 
Prostata 0.0068 
Sinnesorgane 0.0000 
Oterus n 0.0042 
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Etektroniscbcr Northern fur SEQ. ID. NO: 25 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speisefoehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endametrium 
U t e rus_Myame t r ium 
D t e rus_ a 1 1 gerae in 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
We is s e_Blut koerpe rchen 
Zervix 



NORMAL 

%Haeuf igkeit 

0.0039 

0.0217 

0.0123 

0.0000 

0.0102 

0.0115 

0.0148 

0.0107 

0.0220 

0.0000 

0.0085 

0.0058 

0.0145 

0.0000 

0.0086 

0.0081 

0.0050 

0.0030 

0.0109 

0.0203 

0.0076 

0.0102 

0.0096 

0.0059 

0.0089 

0.0000 

0.0286 

0.0106 



TUMOR 

%Haeufigkeit 

0.0281 

0.0132 

0.0000 

0.0026 

0.0150 

0.0046 

0.0154 

0.0000 

0.0847 

0.0000 

0.0000 

0.0117 

0.0061 

0.0077 

0.0360 

0.0000 

0.0166 

0.0000 

0.0064 

0.0000 

0.0068 

0.0000 



Verhaeltnisse 
H/T T/N 
0.1387 7.2110 
1.6527 0.6051 
undef 0.0000 
0.0000 undef 
0.6792 1.4722 
2.4850 0.4024 
0.9599 1.0417 
undef 0.0000 
0.2599 3.8473 
undef undef 
undef 0.0000 
0.4920 2.0326 
2.3708 0.4218 
0. 0000 undef 
0.2380 4.2024 
undef 0.0000 
0.2991 3.3428 
undef 0.0000 
1.7060 0.5862 
undef 0.0000 
1.1223 0.8911 
undef 0.0000 



FOETUS 

%Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0.0028 
Gehirn 0.0125 
Haematopoetisch 0. 0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0000 
Lunge 0.0036 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0061 
Prostata 0.0000 
Sinnesorgane 0.0000, 



NORKTERTE/ SUBTRAHIERTE B I BL I OTHEKEN 
%Haeufigkeit 
Brust 0.0204 
Eierstockjn 0.0000 
EierstockM: 0.0051 
Endokrines_ Gewebe 0.0000 
Foetal 0.0029 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
— Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nervon 0.0010 
Prostata 0.0068 
Sinnesorgane 0.0000 
Uterus n 0.0000 
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Elektroniscbcr Northern fur SEQ. ID. NO: 26 



10 



15 



20 



25 





NORMAL 


TDM0R 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit 


N/T 


T/N 


Blase 


0.0039 


0.0256 


0.1525 6.5555 


Brust 


0.0038 


0.0056 


0.6805 1.4694 


Duenndarm 


0.0031 


0.0000 


undef 


0.0000 


Eierstock 


0.0210 


0.0052 


4.0294 0.2482 


Endokrines__Gewebe 


0.0017 


0.0000 


undef 


0.0000 


Gastrointestinal 


0.0057 


0.0000 


undef 


0.0000 


Gehirn 


0.0037 


0.0031 


1.1999 0.8334 


Haema t opoe t i s ch 


0.0040 


0.0000 


undef 


0.0000 


Haut 


0.0073 


0.0000 


undef 


0.0000 


Hepatisch 


0.0000 


0.0000 


undef 


undef 


Herz 


0.0000 


0.0000 


undef 


undef 


Hoden 


0.0000 


0.0000 


undef 


undef 


Lunge 


0.0073 


0.0020 


3.5562 


! 0.2812 


Magen-Speiseroehre 


0.0000 


0.0000 


undef 


undef 


Muskel-Skelett 


0.0069 


0.0000 


undef 


0.0000 


Niere 


0.0027 


0.0000 


undef 


0.0000 


Pankreas 


0.0017 


0.0000 


undef 


0.0000 


Penis 


0.0030 


0.0000 


undef 


0.0000 


Prostata 


0.0000 


0.0021 


0.0000 


undef 


Uterus_Endametriuni 


0.0000 


0.0000 


undef 


undef 


Oterus_Myometriuin 


0.0076 


0.0068 


1.1223 


0.8911 


Dterus allgemein 


0.0102 


0.0000 


undef 


O.OOOC 



Brust-Hyperpla3ie 0.0000 

Prostata-Hyperplasie 0.0030 

Samenblase 0.0089 

Sinnesorgane 0.0000 

Weiss eBlutkoerperchen 0.0121 

Zervix 0.0000 



FOETUS 

35 %Haeufigkeit 
Entwicklung 0.0000 
Gas troint ens tinal 0-0139 
Gehirn 0.0063 
Haematopoetisch 0.0039 
40 Haut 0.0000 

Hepatisch 0.0000 
Herz-Blutgefaesse 0.0036 
Lunge 0.0036 
Nehenniere 0.0000 
45 ■ Niere 0.0062 

Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0 . 0000 , 



50 

NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0000 
Eierstock_n 0.0000 
55 Eierstock^t 0.0101 

Endokrines_Gewebe 0.0000 
Foetal 0.0029 
Gastrointestinal 0.0488 
Haematopoetisch 0.0114 
w -- Haut-Muskel 0.0130 

Hoden 0.0154 
Lunge 0.0000 
Nerven 0.0020 
Prostata 0.0274 
Sinnesorgane 0.0155 
Uterus n 0.0083 
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Eleklronischer Northern fur SEQ. ID. NO: 27 







TUMOR 


Verhaeltnisse 




vnaeurigKei t 


%Haeufigkeit N/T 


T/N 


Til 


ft nmo 


0.0256 


0.1525 


6.5555 • 


Br~USt 


n m t 
u • 


ft ft 1 Eft 

0.0150 


0.7656 


1.3062 


Duenndann 


U.UJU7 


ft ft ft ft n 
0.0000 


undef 


0.0000 


Eierstock 


n m en 


0.0052 


2.8781 


0.3474 


£>naoKriii6s bcwcDe 


0- 0119 


0.0150 


0.7925 


1.2619 


Gastrointestinal 


0,0057 


0.0139 


0.4142 


2.4145 


beriirn 


A ftft*J A 

0. 0074 


0.0092 


0.8000 


1.2501 


Haematopoetisch 


0, 0147 


0.0000 


undef 


0.0000 


naut 


U . UUo / 


0.1695 


0.0217 


46.1678 


Hepatisch 


0. 0048 


0.0000 


undef 


0.0000 


Herz 


ft ftft*) A 

0. 0074 


0.0137 


0.5397 


1.8529 


Hod en 


ft ft Aft ft 

0. 0000 


0.0234 


0.0000 


undef 


Lunge 


0.0104 


0.0082 


1.2701 


0.7873 


Magen-Speiseroehre 


0.0000 


0.0230 


jO.OOOO 


undef 


Muskel-Skelett 


0.0086 


0.0000 


undef 


0.0000 


Niere 


0.0190 


0.0000 


undef 


0.0000 


Pankreas 


0.0066 


0.0055 


1.1966 


0.8357 


Penis 


0.0150 


0.0267 


0.5616 


1.7807 


Prostata 


0.0087 


0.0149 


0.5849 


1.7096 


Uterus_Endometrium 


0.0203 


0.0000 


undef 


0.0000 


UterusJMyometrium 


0.0000 


0.0136 


0.0000 


undef 


- Uterus_allgemein 


0.0051 


0.0000 


undef 


0.0000 


Brust-Hyperplasie 


0.0064 









Prostata-Hyperplasie 0.0178 

Samenblase 0.0178 

Sinnesorgane 0.0235 

WeisseJBlutkoerperchen 0.0104 

Zervix 0.0000 



FOETUS 

%Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0.0111 
Gehirn 0.0125 
Haematopoetisch 0.0000 
Haut 0.0000 
Hepatisch 0.0260 
Herz-Blutgefaesse 0.0071 
Lunge 0.0217 
Nehenniere 0.0254 
Niere 0.0062 
Placenta 0.0061 
Prostata 0.0000 
Sinnesorgane 0.0000: 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 
%Haeuf igkeit 
Brust 0.0136 
Eierstockji 0.0000 
Eierstock^t 0.0101 
Endokrines_Gewebe 0.0245 
Foetal 0.0181 
Gastrointestinal 0.0244 
Haematopoetisch 0.0114 
= : Haut-Muskel 0.0065 
Hoden 0.0000 
Lunge 0.0164 
Nerven 0.0090 
Prostata 0.0068 
Sinnesorgane 0 . 0232 
Uterus n 0.0000 



31 



DE 198 18 619 A 1 



IS 



20 



25 



30 







Elektronischcr 


Northern fur SEQ 


. JD. NO: 


28 




NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


IHaeufigkeit 


N/T 


T/N 


Blase 


0, 


0312 


0.1968 


0. 1585 


6.3096 


Brust 


0. 


0281 


0.0526 


0.5347 


1.8702 


Duenndarm 


0. 


2177 


0.0000 


undef 


0.0000 


Eierstock 


0. 


0000 


0.0494 


0.0000 


undef 


Endokrines_Gewebe 


0. 


0017 


0.0000 


undef 


0.0000 


Gastrointestinal 


0. 


2778 


0.0324 


8.5792 


0.1166 


Gehirn 


0. 


0007 


0.0133 


0.0554 


18.0566 


Haematopoetisch 


0. 


1470 


0.0000 


undef 


0.0000 


Haut 


0. 


0551 


0.0000 


undef 


0.0000 


Hepatisch 


0. 


0000 


0.1876 


0.0000 


undef 


Herz 


0. 


0011 


0.0137 


0.0771 


12.9706 


Hoden 


0. 


0000 


0.2339 


0.0000 


undef 


Lunge 


0. 


0623 


0.1206 


0.5166 


1.9356 


Ha gen- Speis e r oehre 


0. 


0000 


0.0153 


0.0000 


undef 


Muskel-Skelett 


0. 


1216 


0.0300 


4.0548 


0.2466 


Niere 


0. 


0081 


0.1027 


0-0793 


12.6097 


Pankreas 


0. 


0743 


0.2209 


0.3365 


2.9714 


Penis 


0. 


0000 


0.0000 


undef 


undef 


Prostata 


0. 


0065 


0.0149 


0.4387 


2.2795 


Uterus Endometrium 


0. 


0068 


0.0000 


undef 


0.0000 


Uterus_Myometrium 


0. 


0000 


0.0000 


undef 


undef 


Uterus_allgemein 


0. 


0000 


0.0000 


undef 


undef 


Brust-Hyperplasie 


0. 


0384 








Prostata-Hyperplasie 


0. 


0030 








Samenblase 


0. 


0000 








Sinnesorgane 


0. 


0118 








We i s se_Blut koe rpe r chen 


0. 


0017 








Zervix 


0. 


0000 









35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit. 

0.0000 

0.0000 

0.0000 

0.0079 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0,0000 

0.0000 

0.0000 



50 



55 



60 



65 



Brust 
Eierstock_n 0.0000 
Eierstock^t 0.0051 
Endokrines_Gewebe 0-0000 
Foetal 0.0000 
Gastrointestinal 0-0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0082 
Nerven 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0068 
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Elektronischcr Northern fiir SEQ. ID. NO: 29 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0273 


0.1713 


0.1594 6.2745 


Brust 


0.0217 


0.0395 


0.5509 1.8152 


Duenndarm 


0.1901 


0.0000 


undef 0.0000 


Eierstock 


0.0000 


0.0390 


0.0000 undef 


Endokrines_Gewebe 


0.0017 


0.0000 


undef 0.0000 


Gastrointestinal 


0.2491 


0.0278 


8.9737 0.1114 


Gehirn 


0.0000 


0.0113 


0.0000 undef 


Haeraat opoet is ch 


0.1297 


0.0000 


undef 0.0000 


Haut 


0.0477 


0.0000 


undef 0.0000 


flepatisch 


0.0000 


0.1423 


0.0000 undef 


Herz 


0.0032 


0.0000 


undef 0.0000 


Hoden 


0.0000 


0.2105 


0.0000 undef 


Lunge 


0.0457 


0.0818 


0.5588 1.7894 


Magen-Speiseroehre 


0.0000 


0.0153 


P. 0000 undef 


Muskel-Skelett 


0.0822 


0.0240 


3.4266 0.2918 


Niere 


0.0054 


0.1027 


0.0529 18.9146 


Pankreas 


0.0694 


0.2430 


0.2855 3.5020 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0153 


0.0106 


1.4331 0.6978 


Uteru3_ Endometrium 


0.0068 


0.0000 


undef 0.0000 


Uterus_Myometrium 


0.0000 


0.0000 


undef undef 


0 1 e rus_al Igemein 


0.0000 


0.0000 


undef undef 


Brust -Hype rplasie 


0.0288 






Prostata-Hyperplasie 


0.0030 






Saznenblase 


0.0000 






Sinnesorgane 


0.0118 






Wei3se_Blutkoerperchen 


0.0009 






Zervix 


0.0000 








FOETUS 








%Haeufigkeit 






Entwicklung 


0.0000 






Gastrointenstinal 


0.0000 






Gehirn 


0.0000 






Haemat opoet is ch 


0.0079 






Haut 


0.0000 






Hepatisch 


0.0000 






Herz-Blutgefaesse 


0.0000 






Lunge 


0.0000 






Nebenniere 


0.0000 






Niere 


0.0000 






Placenta 


0.0000 






Prostata 


0.0000 






Sinnesorgane 


0.0000, 







NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0068 
Eierstockji 0.0000 
Eierstock^t 0.0000 
Endokrines_Gewebe 0.0000 
Foetal 0.0000 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
— * Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0082 
Nerven 0,0000 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 
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DE 198 18 619 A 1 



Hektronischer Northern fur SEQ. ED. NO: 30 



10 



is 



20 



25 



30 





NORMAL 


TUMOR 


. Verhaeltnisse 


Blase 


%Haeufigkeit 


%Haeufigkeit N/T T/N 


0.0858 


0.4525 


0.1896 5.2742 


Brust 


0.1279 


0.0977 


. 1.3087 0.7641 


Duenndarm 


0.4998 


0.2150 


2.3242 0.4302 


Eierstock 


0.0180 


0.1353 


0.1328 7.5280 


Endokrines_Gewebe 


0.0307 


0.0176 


1.7466 0.5725 


Gastrointestinal 


0.7434 


0.1341 


5.5413 0.1805 


Gehirn 


0.0030 


0.0524 


0.0565. 17. 7093 


Haematopoetisch 


0.6884 


0.0000 


undef 0.0000 


Haut 


0.0551 


0.1695 


0.3249 3.0779 


Hepatisch 


0.0381 


0.7635 


0.0499 20.0598 


Herz 


0.0095 


0.0412 


0.2313 4.3235 


Hod en 


0.0000 


0.1403 


0.0000 undef 


Lunge 


0.2773 


0.5418 


0.5119 1.9536 


Magen-Speiseroehre 


0.0676 


0.3450 


0.1961 5.0999 


Muskel^Skelett 


0.3203 


0.0660 


4.8544 0.2060 


Niere 


0.0163 


0.1780 


0.0915 10.9284 


Pankreas 


0.0529 


0.2319 


0.2279 4.3875 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0828 


0.0192 


4.3220 0.2314 


0 t erus^Endamet r ium 


0.0068 


0.0000 


undef 0.0000 


Uterus_My ome t r ium 


0.0076 


0.0204 


0.3741 2.6732 


U terus^a 1 lgerae in 


0.0102 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.1343 






Pros tat a-flyperplas ie 


0.0268 






Samenblase 


0.0267 






Sinnesorgane 


0.0353 






Weis se_Blut koe rper chen 


0.0043 






Zervix 


0.0852 







35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit- 
0.0000 
.0000 
.0000 
0118 
0000 
.0000 
,0000 
,0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 



50 



55 



60 



65 



Brust 
Eierstock_n 
Eierstock/_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
=: Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.1837 

0.0000 

0.0405 

0.0000 

0.0023 

0.0976 

0.0000 

0.0000 

0.0000 

0.0000 

0.0020 

0.0000 

0.0000 

0.0042 
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Eleklronischer Northern fur SEQ. H>. NO: 31 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


0.0039 


0.0204 


0.1907 5.2444 


Brust 


0.0077 


0.0056 


1.3611 0.7347 


Duenndarm 


0.0184 


0.0331 


0.5561 1.7982 


Eierstock 


0.0150 


. 0.0104 


1.4391 0.6949 


Endokrines_Gewebe 


0.0102 


0.0000 


under* 0.0000 


Gastrointestinal 


0.0019 


0.0046 


0.4142 2.4145 


Gehirn 


0.0074 


0.0072 


1.0285 0.9723 


Haematopoetisch 


0.0067 


0.0000 


undef 0.0000 


Haut 


0.0147 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0065 


0.0000 undef 


Herz 


0.0064 


0.0000 


undef 0.0000 


Hoden 


0.0345 


0.0234 


1.4759 0.6775 


Lunge 


0.0073 


0.0143 


0.5080 1.9684 


Magen-Speiseroehre 


0.0000 


0.0000 


undef undef 


• Muskel-Skelett 


0.0034 


0.0000 


■undef 0.0000 


Niere 


0.0054 


0.0000 


undef 0.0000 


Pankreas 


0.0050 


0.0055 


0.8974 1.1143 


Penis 


0.0030 


0.0000 


undef 0.0000 


Prostata 


0.0153 


0.0043 


3.5827 0.2791 


0 1 erus_JEndome t r ium 


0.0068 


0.0000 


undef 0.0000 


0 1 e rus_Myometrium 


0.0000 


0.0068 


0.0000 undef 


Uterus_allgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperpla3ie 


0.0000 







Prostata-Hyperplasie 0.0089 

Samenblase 0-0178 

Sinnesorgane 0.0000 

WeisseJBlu tkoerperchen 0.0069 

Zervix 0.0106 



FOETUS 

%Haeufigkeit. 

Entwicklung 0.0000 

Gastrointenstinal 0.0111 

Gehirn 0.0000 

Haematopoetisch 0.0039 

Haut 0.2513 

Hepatisch 0.0000 

Her z-Blutgef aesse 0 . 007 1 

Lunge 0.0036 

Nebenniere 0.0000 

Niere 0.0062 

Placenta 0.0061 

Prostata 0.024 9 

Sinnesorgane 0.0000 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0136 

Eierstock_n 0.0000 

EierstockM: 0.0152 

Endokrines_Gewebe 0.0000 

Foetal 0.0041 

Gastrointestinal 0.0122 

Haematopoetisch 0.0000 

Haut-Muskel 0.0130 

Hoden 0.0077 

Lunge 0.0000 

Nerven 0.0040 

Prostata 0.0137 

Sinnesorgane 0.0000 

Uterus n 0.0000 



35 
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Bektroniscbcr Northern fiir SEQ. ID. NO: 32 



10 



15 



20 



25 



30 



3S 



40 



45 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0039 


0.0204 


0.1907 5.2444 


Brust 


0.0141 


0.0282 


0.4991 2.0038 


Duenndarm 


0.0061 


0.0000 


undef 0.0000 


Eierstock 


0.0090 


0.0312 


0.2878 3.4745 


Endokrines_Gewebe 


0.0153 


0.0176 


0.8733 1.1451 


Gastrointestinal 


0.0192 


0.0231 


0.8283 1.2072 


Gehirn 


0.0222 


0-0113 


1.9635 0.5093 


Haema t opoe t i s ch 


0.0094 


0.0000 


undef 0.0000 


Haut 


0.0110 


0.0000 


undef 0.0000 


Hepatiscb 


0.0048 


0.0129 


0.3676 2.7200 


Herz 


0.0159 


0.0000 


undef 0.0000 


Hoden 


0.0000 


0.0234 


0.0000 undef 


Lunge 


0.0135 


0.0123 


1.1007 0.9085 


Magen-Speiseroehre 


0.0097 


0.0153 


0.6303 1.5866 


Muskel-Skelett 


0.0188 


0.0300 


0.6282 1.5918 


Niere 


0.0163 


0.0205 


0.7930 1.2610 


Pankreas 


0.0083 


0.0221 


0.3739 2.6743 


Penis 


0.0090 


0.0267 


0.3369 2.9678 


Prostata 


0.0109 


0.0043 


2.5591 0.3908 


Uterus_Endametrium 


0.0068 


0.0000 


undef 0.0000 


Uterus^Myometrium 


0.0076 


0.0204 


0.3741 2.6732 


Uterus_allgemein 


0.0102 


0.0954 


0.1067 9.3678 


Brust-Hyperplasie 


0.0032 






Prostata-Hyperplasie 


0.0119 






Samenblase 


0.0089 






Sinnesorgane 


0.0235 






Weisse_Blutkoerperchen 


0.0087 






Zervix 


0.0000 








FOETUS 








%Haeufigkeit 






Entwicklung 


0.0000 






Gastrointenstinal 


0.0111 






Gehirn 


0.0000 






Haematopoetisch 


0.0039 






Haut 


0.0000 


r 




Hepatiscb 


0.0260 






Herz-Blutgefaesse 


0.0107 






Lunge 


0.0072 






Nebenniere 


0.0000 






Niere 


0.0124 






Placenta 


0.0000 






Prostata 


0.0000 






Sinnesorgane 


0.0000! 







50 



55 



60 



65 



Brust 
Eierstock n 
Eierstock^t 
Endokrines__Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/ SO^TRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.1595 

0.0203 ; 

0.0000 

0.0058 

0.0488 

0.0114 

0-0032 

0.0154 

0.0164 

0.0060 

0.0068 

0.0000 

0.0083 
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Elektronischer Northern Fur SEQ. K>. NO: 33 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkelt N/T 


T/N 


Blase 


0.0429 


0.2173 


0.1974 


5. 


0656 


Brust 


0.0409 


0.0357 


1.1462 


0. 


8725 


Duenndarm 


0.1870 


0.0662 


2.8269 


0. 


3537 


Eierstock 


0.0030 


0.0833 


0.0360 


27 


.7957 


Endokr ines^Gewebe 


0.0034 


0.0050 


0.6792 


1. 


4722 


Gastrointestinal 


0.2740 


0.0740 


3.7016 


0. 


2702 


Gehirn 


0.0007 


0.0216 


0.0343 


29 


.1683 


Haematopoetisch 


0.2165 


0.0000 


undef 


0. 


0000 


Haut 


0.0220 


0.0000 


undef 


0* 


0000 


Hepatisch 


0.0095 


0.3429 


0.0277 


36 


.0397 


Herz 


0.0074 


0.0275 


0.2698 


3. 


7059 


Hoden 


0.0000 


0.0585 


0.0000 


undef 


Lunge 


0.1039 


0.1738 


0.5977 


1. 


6731 


Magen-Speiseroehre 


0.0387 


0.0920 


0.4202 


2. 


3799 


' Muskel-Skelett 


0.0994 


0.0420 


2.3660 


0. 


4227 


Niere 


0.0054 


0.0753 


0.0721 


13 


.8707 


Pankreas 


0.0231 


0.1160 


0.1994 


5. 


0142 


Penis 


0.0000 


0.0000 


undef 


undef 


Prostata 


0.0262 


0.0000 


undef 


0. 


0000 


Ute rus_Endome t r ium 


0.0068 


0.0000 


undef 


0. 


0000 


Uterus_Myometrium 


0.0076 


0.0000 


undef 


0. 


0000 


Oterus_allgemein 


0-0051 


0.0000 


undef 


0. 


0000 



Brust -Hyperplasia 0.0512 
Prostata-Hyperplasie 0 . 0178 
Samenblase 0.0089 
Sinnesorgane 0 . 0118 
Weisse_Blutkoerperchen 0.0000 
Zervix 0.0319 



Entwicklung 
Gas t r ointens t inal 
Gehirn 
Haematopoe t i s ch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit- 
0.0000 
0.0000 
0.0000 
0.0079. 
0.0000 
0000 
,0000 
0000 
0000 
0000 
0000 
0.0000 
0.0000 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeuf igkeit 
Brust 0.0612 
Eierstock_n 0.0000 
Eierstock^t 0.0152 
EndokrinesjGewebe 0.0000 
Foetal 0.0012 
Gastrointestinal 0-0366 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000 
Oterus n 0.0000 
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DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 34 



10 



is 



20 



2S 





NORMAL 


TOMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Blase 


0.0117 


0.0588 


0, 


.1990 


5.0259 


Brust 


0.0102 


0.0094 


1. 


.0888 


0.9184 


Duenndann 


0.0153 


0.0331 


0. 


.4634 


2.1579 


Eierstock 


0.0539 


0.0130 


4. 


1445 


0.2413 


EndokrinesjGewebe 


0.0187 


0.0075 


2. 


4906 


0.4015 


Gastrointestinal 


0.0134 


0.0093 


1. 


4496 


0.6898 


Gehirn 


0.0133 


*' 0.0123 


1. 


0799 


0.9260 


Haema topoet i sch 


0.0134 


0.0000 


undef 


0.0000 


Haut 


0.0147 


0.0000 


undef 


0.0000 


Hepatisch 


0.0000 


0.0129 


0. 


0000 


undef 


Herz 


0.0170 


0.0137 


1. 


2336 


0.8107 


Hoden 


0.0173 


0.0234 


0. 


7380 


1.3551 


Lunge 


0.0062 ■ 


0.0020 


3. 


0482 


0.3281 


Magen-Speiseroehre 


0.0676 


0.0000 


undef 


0.0000 


" Muskel-Skelett 


0.0171 


0.0180 


0. 


9518 


1.0506 


Niere 


0.0109 


0.0274 


0. 


3965 


2.5219 


Pankreas 


0.0066 


0.0110 


0. 


5983 


1.6714 


Peni3 


0.0269 


0.0533 


0. 


5054 


1.9786 


Prostata 


0.0327 


0.0213 


1. 


5354 


0.6513 


U t e rus_Endome t r i urn 


0.0068 


0.0000 


undef 


0.0000 


0 terus_Myomet rium 


0.0534 


0.0000 


undef 


0.0000 


Uterus_allgemein 


0.0306 


0,0000 


undef 


0.0000 


Brust-Hyperplasie 


0.0000 











Prostata-Hyperplasie 0.0030 
Sameablase 0 . 0000 
Sinnesorgane 0 . 0118 
30 Weisse_Blutkperperchen 0.0061 
Zervix 0.0213 



FOETUS 

35 %Haeufigkeit. 

Entwicklung 0.0139 
Gastrointenstinal 0 . 0555 
Gehirn 0.0000 
Haematopoetisch 0.0079 
40 Haut 0.0000 

Hepatisch 0.0260 
Herz-Blutgefaesse 0.0071 
Lunge 0.0253 
Nebenniere 0 . 0254 
45 Niere 0.0000 

Placenta 0.0485 
Prostata 0.0000 
Sinnesorgane 0.0000 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0204 

Eierstock_n 0.0000 

EierstocJ^t 0.2430 

Endokrines_Gewebe 0 . 0245 

Foetal 0.0338 

Gastrointestinal 0.0122 

Haematopoetisch 0.0171 

Haut-Muskel 0.0680 

Hoden 0.0077 

Lunge 0.0000 

Nerven 0-0151 

Prostata 0.0342 

Sinnesorgane 0.0000 

0teru3 n 0.1166 
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Elcktronischcr Northern fur SEQ. ID, NO: 35 





NORMAL 


TOMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0273 


0.1227 


0.2225 4.4952 


Brust 


0.0243 


0.0263 


0.9236 1.0828 


Duenndarm 


0.1625 


0.0000 


undef 0.0000 


Bierstock 


0*0000 


0.0364 


0.0000 undef 


Endo kr i ne s_Ge webe 


0.0000 


0.0000 


undef undef 


Gastrointestinal 


0.2146 


0.0093 


23.1935 


Gehlra 


0.0000 


0.0062 


0.0000 undef 


Haematopoetisch 


0.1203 


0.0000 


undef 0.0000 


Haut 


0.0441 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.1229 


0.0000 undef 


Herz 


0.0021 


0.0000 


undef 0.0000 


Hoden 


0.0000 


0.0935 


0.0000 undef 


Lunge 


0.0353 


0.0900 


0.3926 2.5473 


Magen-Speiseroehre 


0.0000 


0.0077 


0.0000 undef 


" Muskel-Skelett 


0.0857 


0.0120 


7.1388 0.1401 


Niere 


0.0081 


0.0753 


0.1081 9.2471 


Pankreas 


0.0529 


0.1381 


0.3829 2.6116 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0109 


0.0064 


1.7060 0.5862 


Oterus_Endometrium 


0.0000 


0.0000 


undef undef 


Oterus_Myoraetrium 


0.0000 


0.0000 


undef undef 


Uterus__allgemein 


o.booo 


0.0000 


undef undef 


Bmst-Hyperplasie 


0.0160 







Prostata-Hyperplasie 0.0000 

Samenblase 0.0000 

Sinnesorgane 0.0118 

Weisse_Blutkoerperchen 0.0009 

Zervix 0.0106 



0.0431 



10 



15 



20 



25 



30 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETOS 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0039 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000.. 



35 



40 



45 



Brust 
Eierstock n 0.0000 
EierstockM: 0-0000 
Endokrines_Gewebe 
Foetal 
Gas trointest inal 
. Haematopoetisch 

Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0082 
Nerven 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0000 



0.0000 
0.0000 
0.0000 
0.0000 



50 



55 



60 



65 



39 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 36 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit 


N/T T/N 


Blase 


0.0078 


0.0332 


0.2347 4.2611 


Brust 


0,0153 


0.0169 


0.9074 1.1021 


Duenndarm 


0.0092 


0.0165 


0.5561 1.7982 


Eierstock 


0.0210 


0.0130 


1.6118 0.6204 


Endokrines_Gewebe 


0.0085 


0.0150 


0.5660 1.7667 


Gastrointestinal 


0.0153 


0.0231 


0.6627 1.5090 


Gehim 


0.0185 


0.0226 


0.8181 1.2223 


Haematopoetisch 


0.0227 


0.0379 


0.5999 1.6669 


Haut 


0.0073 


0.0000 


undef 0.0000 


Hepatisch 


0.0048 


0.0194 


0.2451 4.0800 


Herz 


0.0201 


0.0137 


1.4 649 0.6827 


Hoden 


0.0115 


0.0234 


0.4920 2.0326 


Lunge 


0.0166 


0.0164 


1.0161 0.9842 


Magen- Spe i s e roehre 


0.0000 


0.0153 


0.0000 undef 


Muskel-Skelett 


0.0137 


0.0060 


'2.2844 0.4378 


Niere 


0.0081 


0.0274 


0.2974 3.3626 


Pankreas 


0.0050 


0.0055 


0.8974 1.1143 


Penis 


0.0180 


0.0267 


0.6739 1,4839 


Prostata 


0.0065 


0.0106 


0.6142 1.6282 


Uterus_Endometrium, 


0.0135 


0.0528 


0.2561 3.9053 


0 terusJMyome t r ium 


0.0152 


0.0408 


0.3741 2.6732 


Uterus_allgemein 


0.0458 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0256 







10 



15 



20 



25 

Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
30 WeisseJBlutkoerperchen 
Zervix 



0208 
0178 
0118 
0061 
0106 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehim 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit. 

0.0417 

0.0167 

0.0250 

0.0079 

0.2513 

0.0000 

0.0142 

0.0181 

0.0000 

0.0000 

0.0242 

0.0249 

0.0000 



NORMIERTE / SUBTRAH IERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0136 
Eierstock_n 0.0000 
Eierstocfr_t 0.0101 
Endokrines_ Gewebe 0.0000 
Foetal 0.0082 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
„ Haut-Muskel 0.0292 
Hoden 0.0077 
Lunge 0.0164 
Nerven 0.0110 
Prostata 0.0068 
Sinnesorgane 0.0000 
. Uterus n 0.0208 
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Elektroniscber Northern fiir SEQ. ID. NO: 37 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeiifigkeit N/T 


T/N 


Blase 


0.0078 


0.0332 


0.2347 4.2611 


Brust 


0.0026 


0.0000 


. undef 


0.0000 


Duenndarm 


0.0000 


0.0000 


undef 


undef 


Eierstock 


0.0000 


0.0000 


undef 


undef 


Endokrines_Gewebe 


0.0000 


0.0000 


undef 


undef 


Gastrointestinal 


0.0115 


0.0000 


undef 


0.0000 


Gehirn 


0.0000 


0.0000 


undef 


undef 


Haematopoetisch 


0.0147 


0.0000 


undef 


0.0000 


Haut 


0.0000 


0.0000 


undef 


undef 


Hepatisch 


0.0000 


0.0000 


undef 


undef 


Herz 


o.oooo 


0.0000 


undef 


undef 


Hoden 


0.0000 


0.0000 


undef 


undef 


Lunge 


0.0021 


0.0020 


1.0161 0.9842 


Magen-Speiseroehre 


0.0000 


0.0000 


undef 


undef 


" Muskel-Skelett 


0.0069 


0.0000 


undef 


0.0000 


Niere 


0.0000 


0.0000 


undef 


undef 


Pankreas 


0.0017 


0.0607 


0.0272 36.7712 


Peni3 


0.0000 


0.0000 


undef 


undef 


Prostata 


0.0000 


0.0000 


undef 


undef 


Uteru3_ Endometrium 


0.0000 


0.0000 


undef 


undef 


Uterus_Myometrium 


0.0000 


0.0000 


undef 


undef 


Ut erus_al lgenve in 


0.0000 


0.0000 


undef 


undef 



Brust-Hyperplasie 0.0000 

Prostata-Hyperplasie 0 . 0000 

Samenblase 0.0000 

Sinnesorgane 0.0000 

WeisseBlutkoerperchen 0 . 0009 

Zervix 0.0000 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
- ' Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeuf igkeit 

0.0000 

0.0000 

0.0000 

0.0039 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000. 



Brust 
Eierstock_n 
EierstockM: 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE / SUBTRAH IERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 
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Elektroniscber Northern fur SEQ. ED. NO: 38 



10 



is 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0624 


0. 


.2607 


0.2393 4. 


.1791 


Brust 


0.0051 


0. 


0282 


0.1815 5. 


.5104 


Duenndarm 


0.0337 


0. 


.0000 


undef 0 , 


,0000 


Eierstock 


0.0030 


0. 


0937 


0.0320 31.2702 


Endokrines Gewebe 


0.0034 


0. 


0025 


1.3585 0. 


.7361 


Gastrointestinal 


0.1360 


0. 


0694 


1.9604 0. 


5101 


Gehirn 


0.0015 


0. 


0318 


0.0464 21.5290 


Haematopoetisch 


0.1337 


0. 


0000 


undef 0 . 


0000 


Haut 


0.0147 


0. 


0000 


undef 0. 


0000 


Hepatisch 


0.0000 


0. 


4594 


0.0000 undef 


Herz 


0.0127 


0. 


0137 


0.9252 1. 


0809 


Hoden 


0.0000 


0. 


1754 


0.0000 undef 


Lunge 


0.0540 


0. 


2086 


0.2590 3. 


8610 


Magen-Speiseroehre 


0.0193 


0. 


0767 


0.2521 3. 


9666 


Muskel-Skelett 


0.1747 


0. 


0240 


7.2815 0. 


1373 


Niere 


0.0190 


0. 


0205 


0.9252 1. 


0808 


Pankreas 


0.0611 


0. 


0773 


0.7906 1. 


2649 


Penis 


0.0030 


0. 


0000 


undef 0. 


0000 


Prostata 


0.0196 


0. 


0021 


9.2126 0. 


1085 


Dt erus_Endame t riura 


0.0135 


0. 


0000 


undef 0. 


0000 


Oterus_Myometrium 


0.0000 


0. 


0000 


undef undef 


D t erus_al lgeme in 


0.0000 


0. 


0000 


undef undef 


Brust- Hyperplasie 


0.0064 











Prostata-Hyperplasie 0.0089 
Samenblase 0.0089 
Sinnesorgane 0.0353 
30 WeisseBlutkoerperchen 0.0009 
Zervix 0.0426 



FOETDS 

%Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0. 0000 
Gehirn 0.0000 
Haematopoetisch 0.0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0000 
Lunge 0.0000 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0061 
Prostata 0.0000 
Sinnesorgane 0.0000* 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0000 
Eierstock^n 0.0000 
Eierstock^t 0.0101 
Endokrines_Gewebe 0.0000 
Foetal 0.0000 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0010 
Prostata 0.0000 
Sinnesorgane 0.0000 
0teru3 n 0.0000 



40 



42 



DE 198 18 619 A 1 

Elcktronischer Northern fur SEQ. ED. NO: 39 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeu£igkeit 


%Baeufigkeit N/T T/H 


Blase 


0.0702 


0.2761 


0.2542 3.9333 


Brust 


0.0614 


0.0620 


0.9899 1,0102 


Duenndarm 


0.2024 


0.1489 


1.3594 0.7356 


Eierstock 


0.0060 


0.0442 


0.1354 7.3832 


Endokrines__Gewebe 


0.0255 


0.0025 


10.1887 0 


Gastrointestinal 


0.4138 


0.0740 


5.5913 0.1788 


Gehirn 


0.0044 


0.0431 


0.1029 9.7228 


Haematopoetisch 


0.5774 


0.0000 


undef 0.0000 


Haut 


0.0257 


0.0000 


undef 0.0000 


Hepatisch 


0.0285 


0.3364 


0.0848 11.7866 


Berz 


0.0095 


0.0412 


0.2313 4.3235 


Horien 


0.0000 


0.2689 


0.0000 undef 


Lunge 


0.1735 


0.2678 


0.6476 1.5441 


Magen- Spe is e roehr e 


0.0870 


0.1917 


0.4538 2.2037 


Muskel-Skelett 


0.2193 


0.0180 


12.1B35 0, 


Niere 


0.0136 


0.1164 


0.1166 8.5746 


Pankreas 


0.0677 


0.1822 


0.3717 2.6906 


Penis 


0.0030 


0.1333 


0.0225 44.5175 


Prostata 


0.0327 


0.0085 


3.8386 0.2605 


0 t e rus_Endomet r ium 


0.0000 


0.0000 


undef undef 


Uterus_Myometriura 


0.0000 


0.0000 


undef undef 


Dt e rus^al lgexne in 


0.0051 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0927 







Prostata-Hyperplasie 0 . 0178 
Samenblase 0.0267 
Sinnesorgane 0.2823 
Weisse_Blutkoerperchen 0 . 0087 
Zervix 0.0426 



0981 



10 



15 



20 



25 



30 



En twickl*ung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 
%Haeufigkeit 
0.0000 
.0056 
.0000 
.0157 
.0000 
0.0000 
0.0000 
0.0000 
0.0254 
0.0000 
0.0000 
0.0000 

0.000Q: 



35 



40 



45 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 



Brust 


0. 


4082 


Eierstock n 


0. 


0000 


Eierstoc)c_ t 


0. 


3493 


Endokrines__Gewebe 


0. 


0000 


Foetal. 


0. 


0082 


Gastrointestinal 


0, 


1953 


Haematopoetisch 


0. 


0000 


Haut-Muskel 


0. 


0000 


Hoden 


0. 


0000 


Lunge 


0. 


0491 


Nerven 


0. 


0030 


Prostata 


0. 


0000 


Sinnesorgane 


0. 


0000 


Uterus n 


0. 


0000 
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5 



10 



IS 



25 





Elektronischcr 


Northern fur SEQ. 


ID 


.NO: 


40 




NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit 


N/T 


T/N 


Blase 


0.0195 


0.0690 


0. 


2825 


3.5400 


Brust 


0.0166 


0.0320 


0. 


5204 


1.9216 


Duenndarm 


0.0031 


0.0000 


undef 


0.0000 


Eierstock 


0.0150 


0.0130 


1. 


1513 


0.8686 


Endokrines_Gewebe 


0.0085 


0.0100 


0. 


8491 


1.1778 


Gastrointestinal 


0.0019 


0.0093 


0. 


2071 


4.8289 


Gehirn 


0.0067 


0.0442 


0. 


1507 


6.6362 


Haematopoetisch 


0.0187 


0.0000 


undef 


0.0000 


Haut 


0.0073 


0.0000 


undef 


0.0000 


Hepatisch 


0.0238 


0.0194 


X. 


2255 


0.8160 


Herz 


0.0625 


0.1512 


0. 


4135 


2.4182 


Hoden 


0.0345 


0.0117 


2. 


9518 


0.3388 


Lunge 


0.0322 


0.0286 


1. 


1249 


0.8889 


Magen-Speiseroehre 


0.0000 


0.0307 


0. 


0000 


undef 


Muskel-Skelett 


0.0668 


0.1260 


0. 


5303 


1.8857 


Niere 


0.0190 


0.0342 


0. 


5551 


1.8014 


Pankreas 


0.0050 


0.1160 


0. 


0427 


23.3998 


Penis 


0.0299 


0.0000 


undef 


0.0000 


Prostata 


0.0131 


0.0170 


0. 


7677 


1.3026 


UterusJEndometrium 


0.0068 


0.0528 


0. 


12B0 


7.8106 


Ut e rus_My ome t r iura 


0.0305 


0.0204 


1. 


4964 


0.6683 


Uterus^allgemein 


0.0357 


0.0000 


undef 


0.0000 


Bru3 t-Hype rpl a s i e 


0.0000 











Prostata-Hyperplasie 0 . 0208 
Samenblase 0.0356 
Sinnesorgane 0.0588 
30 Weisse Blutkoerperchen 0.0000 
Zervix 0.0319 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0167 

0.0000 

0.0118 

0.0000 

0.0520 

0.0107 

0.0325 

0.0000 

0.04 94 

0.0909 

0.0000 

0.0000/ 



NORM I ERTE / SUBTRAH IERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0340 
Eierstoc)f_n 0.0000 
Eierstock_t 0.0203 
55 Endokrines Gewehe 0.0490 

~Foetai 0.0297 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
60 Hoden 0.0154 

Lunge 0.0082 
Nerven 0.0030 
Prostata 0.0000 
Sinnesorgane 0.0000 
65 Uterus n 0.0000 
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Hektionischer Northern fur SEQ. ID. NO: 41 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


0.0390 


0.1355 


0.2878 3.4744 


Brust 


0.0269 


0.0564 


0. 4764 2. 0992 


Duenndarm 


0.1932 


0.0000 


undef 0.0000 


Eierstock 


0.0000 


0.0312 


0. 0000 undef 


Endokrines_Gewebe 


0.0017 


0.0025 


0. 6792 1. 4722 


Gastrointestinal 


0.2203 


0. 0509 


4 . OJUU^U. ZoUU 


Gehirn 


0.0000 


0.0103 


0.0000 undef 


Haema topoe t is ch 


0.0882 


0.0000 


undef 0.0000 


Haut 


0.0184 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0518 


0.0000 undef 


Herz 


0.0095 


0.0137 


0.6939 1.4412 


Hoden 


0.0000 


0.0702 


0.0000 undef 


Lunge 


0.0478 


0.1472 


0.3246 3.0809 


Magen-Speiseroehre 


0.0193 


0.0153 


1.2605 0.7933 


Muskel-Skelett 


0.1011 


0.0120 


8. 4237 0. 1187 


Niere 


0.0054 


0.0548 


0. 0991 10. 0878 


Pankreas 


0,0099 


0.0552 


0. 1795 5.5714 


Penis 


0.0000 


0.2933 


0. 0000 undef 


Prostata 


0.0153 


0.0021 


7.1654 0. 1396 


Oterus_ Endometrium 


0.0068 


0.0000 


undef 0.0000 


Uterus_Myometrium 


0.0000 


0.0068 


0.0000 undef 


Oterus_allgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0192 






Prostata-Hyperplasie 


0.0119 






Samenblase 


0.0000 






Sinnesorgane 


0.0000 






Weisse Blutkoerperchen 


0.0009 






Zervix 


0.0213 








FOETUS 








%Haeu£igkeit 






Entwicklung 


0.0000 






Gastrointenstinal' 0.0000 






Gehirn 


0.0000 






Haema topoetisch 


0.0079 






Haut 


0.0000 






Hepatisch 


0.0000 






Herz-Blutgefaesse 


0.0000 






Lunge 


0,0000 






Nebenniere 


0.0000 






Niere 


0.0000 






Placenta 


0.0000 






Prostata 


0.0000 






Sinnesorgane 


0.0000: 







NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufig)ceit 

Brust 0.0000 

Eierstockjn 0.0000 

Eierstock^t 0.0000 

Endokrines_Gewebe 0.0000 

Foetal 0.0000 

Gastrointestinal 0.0000 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0000 

Prostata 0.0000 

Sinnesorgane 0.0000 

. Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 42 



S 



10 



20 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


0.0273 


0.0895 


0.3051 3.2777 


Brust 


0.0665 


0.0808 


. 0.8230 1.2151 


Duenndarm 


0.0429 


0;0000 


undef 0.0000 


Eierstock 


0.0539 


0.0234 


2.3025 0.4343 


EndokrinesjGewebe 


0.0630 


0.0978 


0.6444 1.5518 


Gastrointestinal 


0.0441 


0.0324 


1. 3608 0.7348 


Gehirn 


0.0554 


0.1006 


0.5510 1.8149 


Haematopoetisch 


0.0454 


0.0379 


1. 1998 0.8335 


Haut 


0.0257 


0.0000 


undef 0.0000 


Hepatisch 


0.0381 


0.0453 


0.8403 1.1900 


Herz 


0.0435 


0.0825 


0.5268 1.8981 


Hod en 


0.0575 


0.0000 


under 0.0000 


Lunge 


0. 1008 


0.0552 


1 . oZoZ 0.54/9 


Magen-Speiseroehre 


0. 0580 


0.Q997 


rt CD1Q 1 71 oo 

U, oolo 1 . /loo 


* Muskel-Skelett 


n nan c 
U . US / b 


u. UOOU 


J. • 4 / 7 I U. U / JO 


Niere 


0.0516 


0.0890 


0.5795 1.7255 


Pankreas 


0.0248 


0.0773 


0.3205 3.1200 


Penis 


0.0599 


0.1066 


0.5616 1.7807 


Prostata 


0.0567 


0.0766 


0.7393 1.3527 


U t erus_Endome t r ium 


0.0405 


0.1055 


0.3841 2.6035 


Uterus_Myometrium 


0.0534 


0.0475 


1.1223 0.8911 


Uterus_allgemein 


0.0866 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0416 







Prostata-Hyperplasie 0 . 0654 

Sainenblase 0.0712 

Sinnesorgane 0.0823 

Wei3seJBlutkoerperchen 0 . 1110 

Zervix 0.0319 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Qehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0139 

0.0361 

0.0125 

0.0433 

0.0000 

0.0000 

0.0071 

0.0253 

0.0254 

0.0432 

0.0364 

0.0499 

0.0126 



50 



55 



60 



65 



Brust 
Eierstock_n 
EierstockM: 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.1020 

0.1595 

0.0709 

0.0000 

0.0635 

0.0000 

0.0114 

0.0680 

0.04 63 

0.0328 

0.0351 

0.0342 

0.0464 

0.0083 
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Elektronischer Northern fur SEQ. ED. NO: 43 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0.0195 


0.0639 


0.3051 


3. 


2777 


Brust 


0.0345 


0. 0470 


0,7350 


1 . 


3606 


Duenndarm 


0.0399 


0.1985 


0.2008 


4. 


9797 


EierstocJc 


U. 0150 


0.0676 


0.2214 


4. 


5168 


Endokr ine s_Gewebe 


0.0238 


0.0878 


0.2717 


3. 


6805 


Gastrointestinal 


0.0862 


0.1064 


0.8103 


1. 


2341 


Gehirn 


0.0067 


0.0277 


0.2400 


4. 


1669 


Haematopoetisch 


0.0080 


0.0000 


undef 


0. 


0000 


Haut 


0.0587 


0.0000 


undef 


0. 


0000 


Hepatisch 


0.0190 


0.0323 


0.5882 


1. 


7000 


flerz 


0.0011 


0.0962 


0.0110 


90 


t.7941 


Hoden 


0.0000 


0.0234 


0.0000 


undef 


Lunge 


0.0062 


0.0164 


0.3810 


2. 


6245 


Magen-Speiseroehre 


0.0387 


0.3450 


0.1120 


8. 


9248 


• Muskel-Skelett 


0.0000 


a men 


U. UUUU 


undef 


Niere 


0. 0760 


0.1643 


0.4626 


2. 


1617 


Pankreas 


U.UD/ / 


0.0276 


2.4530 


0. 


4077 


Penis 


0.0090 


0.0533 


0.1685 


5. 


9357 


Prostata 


0.0109 


0.0255 


0.4265 


2. 


3446 


Dterus — Endometrium 


0.0270 


0.0000 


undef 


0. 


0000 


OterusJMyometrium 


0.0076 


0.0272 


0.2806 


3. 


5642 


Dterus_allgemein 


0.0000 


0.4771 


0.0000 


undef 


Brust-Hyperplasie 


0.0576 











Prostata-Hyperplasie 0.0119 

Samenblase 0 . 1068 

Sinnesorgane 0.0235 

Weisse_Blutkoerperchen 0 . 0061 

Zervix 0.0319 



FOETUS 

%Haeufigkeit. 

Entwicklung 0.0000 

Gastrointenstinal 0.0111 . 

Gehirn 0.0813 

Haematopoetisch 0 . 0000 

Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0000 

Lunge 0.0145 

Nebenniere 0.0000 

Niere 0.0309 

Placenta 0.0121 

Prostata 0.0249 

Sinnesorgane 0.0000 



NORMIERTE / SDBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0136 

Eierstock_n 0.0000 

Eierstock^t 0.0101 

Endokrines_Gewebe 0.0000 

FoetaX 0.0122 

Gastrointestinal 0.4149 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0154 

Lunge 0.0573 

Nerven 0.0040 

Prostata 0.0068 

Sinnesorgane 0.0000 

Uterus n 0.0000 
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Elcktronischer Northern fur SEQ. ID. NO: 44 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndarm 
Eier stock 
Endokrines_Gewebe 
Gastrointestinal 
Gehira 
Haema topoe tisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
D te rusjMyome t r ium 
Ot e rus_a 1 lgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weiss e_Blutkoerper chen 
Zervix 



NORMAL 

%Haeufigkeit 

0.0156 

0.0166 

0.0828 

0.0030 

0.0017 

0.1226 

0.0000 

0.0481 

0.0294 . 

0.0000 

0.0032 

0-0000 

0.0447 

0.0097 

0.0497 

0.0054 

0.0215 

0.0030 

0.0044 

0.0000 

0.0000 

0.0000 

0.0352 

0.0030 

0.0000 

0.0000 

0.0000 

0.0106 



TUMOR 

%Haeu£igkeit 

0.0511 

0.0320 

0.0000 

0.0416 

0.0050 

0.0278 

0.0123 

0.0000 

0.0000 

0,1294 

0-0137 

0.1403 

0.1390 

0.0000 

0.0060 

0.0685 

0-0166 

0.0000 

0.0043 

0.0000 

0.0000 

0.0000 



Verhaeltaisse 
N/T T/N 
0.3051 3.2777 
0.5204 1.9216 
undef 0.0000 
0.0720 13.8979 
0.33962.9444 
4.4178 0.2264 
0.0000 undef 
undef 0.0000 
undef 0.0000 
0.0000 undef 
0.2313 4.3235 
0.0000 undef 
0.3213 3.1128 
undef 0.0000 
6.2810 0-1208 
0.0793 12-6097 
1.2963 0.7714 
undef 0.0000 
1.0236 0.9769 
undef undef 
undef undef 
undef undef 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haema topoet i sen 
Haut 
Hepa tisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit. 

0.0000 

0.0000 

0.0000 

0.0079 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



50 



55 



60 



65 



Brust 
Eierstockja 
Eierstock^_t 
Endokrines_ Gewebe 
Foetal 
Gastrointestinal 
Haema topoe tisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/SDBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0-0136 

0.0000 

0.0000 

0.0000 

0-0000 

0.0000 

0-0000 

0.0000 

0.0000 

0.0000 

0-0000 

0.0000 

0.0000 

0.0000 
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Elcktrooischer Northern fur SEQ. ID. NO: 45 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0117 


0.0383 


0. 


3051 3. 


2777 


Brust 


0.0179 


0.0207 


0. 


8661 1. 


1546 


Duenndarm 


0.0245 


0.0165 


1. 


4830 0. 


6743 


Eierstock 


0.0329 


0.0156 


2. 


1106 0. 


4738 


Endokrines_Gewebe 


0.0221 


0.0326 


0. 


6792 1. 


4722 


Gastrointestinal 


0.01S3 


0.0139 


1. 


1045 0. 


9054 


Gehirn 


0.0296 


0.0288 


1. 


0285 0. 


9723 


Haematopoetisch 


0.0187 


0.0379 


0. 


4940 2. 


0241 


Haut 


0.0257 


0.0000 


undef 0. 


0000 


Hepatisch 


0.0048 


0.0129 


0. 


3676 2. 


7200 


Herz 


0.0201 


0.0550 


0. 


3662 2. 


7306 


floden 


0.0058 


0.0117 


0. 


4920 2. 


0326 


Lunge 


0.0218 


0.0225 


0. 


9699 1. 


0311 


Magen-rSpeiseroehre 


0.0387 


0.0383 


.1. 


0084 0. 


9916 


■ Muskel-Skelett 


0.0171 


0.0240 


0. 


7139 1. 


4008 


Niere 


0.0190 


0.0068 


2. 


7756 0. 


3603 


Pankreas 


0.0116 


0.0221 


0. 


5235 1. 


9102 


Penis 


O.0090 


0.0000 


undef 0. 


0000 


Prostata 


0.0327 


0.0319 


1. 


0236 0. 


9769 


U te rus^Endome t r ium 


0.0068 


0.0000 


undef 0 . 


0000 


D t e rus_Myome t r ium 


0.0000 


0.0136 


0. 


0000 undef 


Uterus allgemein 


0.0560 


0.0000 


undef 0 . 


0000 


Brust-Hyperplasie 


0.0032 











Prostata-Hyperplasia 0 - 0208 

Samenblase 0.0178 

Slnnesorgane 0 . 0118 

Weisse_J$lutkoerperchen 0 . 0260 

Zervix 0.0000 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haeraatopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETDS 

%Haeufigkeit'. 

0.0000 

0.0139 

0.0313 

0.0039 

0.0000 

0.0000 

0.0071 

0.0217 

0.0000 

0.0000 

0.0424 

0.0249 

0.0000 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN - 
%Haeuf igkeit 

Brust 0.0408 

Eierstock^n 0.0000 

Eierstock^t 0.0405 

Endokrines_Gewebe 0.0245 

Foetal 0.0087 

Gastrointestinal 0.0000 

Haematopoetisch 0.0057 

Haut-Muskel 0;0097 

Hoden 0.0231 

Lunge 0.0082 

Nerven 0.0251 

Prostata 0.0205 

Sinnesorgane 0.0000 

Uterus n 0.0125 
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Ekktronischer Northern fiir SEQ. ID. NO: 46 



10 



15 



20 



25 



30 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit 


N/T 


T/N 


Blase 


0. 


1170 


0.3067 


0. 3814 


2. 


6222 


Brust 


0. 


3019 


0.2387 


1.2646 


0. 


7908 


Duenndarm 


1. 


1559 


0.0000 


undef 


0. 


0000 


Eierstock 


0. 


0120 


0.0676 


0.1771 


5. 


6460 


Endokrines_Gewebe 


0. 


0034 


0.0125 


0.2717 


3. 


6805 


Gastrointestinal 


1. 


2798 


0.1804 


7.0940 


0. 


1410 


Gehirn 


0, 


0007 


0.0380 


0*0195 


51 


.3918 


Haematopoetisch 


0. 


4785 


0.0000 


undef 


0. 


0000 


Haut 


0. 


1322 


0.0000 


undef 


0. 


0000 


Hepatisch 


0. 


0285 


0.4594 


0.0621 


16 


1.0932 


Herz 


0. 


0138 


0.0275 


0.5011 


1. 


9955 


Hoden 


0. 


0000 


0.2456 


0.0000 


undef 


Lunge 


0. 


3625 


0.3435 


1.0554 


0. 


9475 


Magen-Speiseroehre 


0. 


0483 


0.1533 


0.3151 


3. 


1733 


' Muskel-Skelett 


0. 


2124 


0.0480 


'4.4260 


0. 


2259 


Niere 


0. 


0163 


0.1917 


0 . 0850 


11 


.7691 


Pankreas 


0. 


1074 


0.4528 


0.2371 


4. 


2171 


Penis 


0. 


0000 


0.0533 


0.0000 


undef 


Prostata 


0. 


0806 


0.0255 


3.1562 


0. 


3168 


Dt e rus jEndome t r ium 


0. 


0068 


0.0000 


undef 


0. 


0000 


Uteru3_Myometrium 


0. 


0152 


0.0000 


undef 


0. 


0000 


Uterus_allgemein 


0. 


0102 


o.oooo 


undef 


0. 


0000 


Brust-Hyperplasie 


0. 


3805 











Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weis se_Blutkoerperchen 
Zervix 



0.0505 
0.0356 
0.5175 
0.0078 
0.0213 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeuf igkeit. 
0.0000 
0.0000 
0.0000 
0,0039 
0.0000 
0.0000 
0.0000 
0000 
0000 
0000 
0303 
0.0000 
0.0000 



50 



55 



60 



65 



Brust 
Eierstock_ n 
Eierstock«_t 
Endokrines_Gewebe 
"Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskcl 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

IHaeufigkeit 

0.7687 

0.0000 

0.1924 

0.0000 

0.0029 

0.0976 

0.0000 

0.0000 

0.0000 

0.0328 

0.0020 

0.0000 

0.0000 

0.0000 
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Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen- Spei s e r oehre 
' Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
D t e rus_Endome t r ium 
U te rus_Myome t r ium 
Uterus_allgemein 
Brus t-Hyperplas ie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blu£koerperchen 
Zervix 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



Brust 
Eierstock_n 
Eierstoc)e[_t 
Endok r ines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



Elektronischer Northern fiir SEQ. ID. NO: 47 



NORMAL 


TUMOR' 


Verhaeltnisse 


%Haeufigkeit 


%Haeufigkeit N/T T/N 


0.0195 


0.0486 


0.4014 2.4911 


0,0550 


0.1297 


. 0.4241 2.3580 


0.0123 


0.1654 


0-0741 13.4866 


0.0449 


0.0728 


0.6167 1.6214 


0.0119 


0.0075 


1.5849 0.6309 


0.0153 


0.1018 


0.1506 6.6398 


0.0037 


0.0359 


0.1029 9.7228 


0.0134 


0.0000 


undef 0.0000 


0.0404 


0.0847 


0.4765 2.0985 


0.0285 


0.0323 


0.8824 1.1333 


0.0233 


0.0825 


0.2827 3.5374 


0.0460 


0.0351 


1.3119 0.7622 


0.0551 


0.0491 


1.1219 0.8913 


0.0000 


0.0613 


0.0000 undef 


0.1148 


0.3120 


0.3679 2.7180 


0.0271 


0.0137 


1.9826 0.5044 


0.0116 


0.0884 


0.1309 7.6408 


0.0180 


0.0533 


0.3369 2.9678 


0.0087 


0.0319 


0.2730 3.6634 


0.0270 


0.0000 


undef 0.0000 


0.0610 


0.1902 


0.3206 3.1187 


0.0509 


0.0000 


undef 0.0000 



0.0288 
0.0069 
0.0178 
0.0118 
0.0000 
0.0532 



F0ET0S 

%Haeuf igkeit 

0.1809 

0.1388 

0.0063 

0.0551 

0.2513 

0.0000 

0.0605 

0.1409 

0.2535 

0.1235 

0.0364 

0.0748 

1.3934, 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0204 

0.0000 

0.0304 

0.0000 

0.0315 

0.0122 . 

0.0000 

0.0518 

0.0231 

0.0000 

0.0040 

0.0137 

0.0000 

0.0083 
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Elcklronischer Northern fur SEQ. ID. NO: 48 



to 



is 



20 



25 



30 





NORMAL 


T0MOR 


Verhaeltnisse 




IHaeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0.0624 


0.1380 


0. 


4520 


2.2125 


Brust 


0.0345 


0.0564 


0. 


6125 


1.6327 


Duenndarxn 


0.049i 


0.0165 


2. 


9659 


0.3372 


Eierstock 


0.0689 


0.0520 


1. 


3239 


0.7553 


EndokrinesjGewebe 


0.0392 


0.0276 


1. 


4202 


0.7041 


Gastrointestinal 


0.0460 


0.0648 


0. 


7100 


1.4084 


Gehirn 


0.0333 


0.0678 


0. 


4909 


2.0372 


Haema t opoe t is ch 


0.0374 


0.0758 


0. 


4940 


2.0241 


Haut 


0.0257 


0.1695 


0. 


1516 


6.5954 


Hepatisch 


0.1142 


0.0518 


2. 


2059 


0.4533 


Herz 


0.0774 


0.7010 


0. 


1104 


9.0616 


Hoden 


0.2589 


0.1520 


1. 


7030 


0.5872 


Lunge 


0.0540 


0.0491 


1. 


1007 


0.9085 


Magen-Speiseroehre 


0.1256 


0.2070 


0. 


6069 


1.6477 


' Muskel-Skelett 


0.1542 


0,2100 


b. 


7343 


1.3619 


Niere 


0.0109 


0.0959 


0. 


1133 


8.8268 


Pankxeas 


0.0198 


0.1270 


0. 


1561 


6.4071 


Peni3 


0.0359 


0.0800 


0. 


4493 


2.2259 


Prostata 


0.0785 


0.0511 


l. 


5354 


0.6513 


D te rus^Endome t rium 


0.0338 


0.0528 


0. 


6402 


1.5621 


Uterus_Myometrium 


0.0457 


0.0679 


0. 


6734 


1.4851 


Uterus_allgemein 


0.0764 


0.0954 


0. 


8006 


1.2490 


Brust-Hyperplasie 


0,0224 










Prostata-Hyperplasie 


0.0624 










Samenblase 


0.0445 










Sinnesorgane 


0.0118 










Weiss e_Blutkoerperchen 


0.0529 










Zervix 


0.0532 











FOETDS 

35 %Haeufigkeit. 

Entwicklung 0.0139 

Gastrointenstinal 0.0167 

Gehirn 0.0000 

Haema topoetisch 0.0039 

40 Haut 0-0000 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0249 

Lunge 0.0108 

Nebenniere 0.0000 

45 - Niere 0.0000 

Placenta 0.3333 

Prostata 0.1995 

Sinnesorgane 0.0000. 



50 

NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.1156 
Eierstockji 0.0000 
55 EierstocKM: 0.1873 

Endokrines_Gewebe 0. 0000 
Foetal 0.0181 
Gastrointestinal 0.0000 
Haema topoetisch 0.0057 
Haut-Muskel 0.0486 
Hoden 0.0000 
Lunge 0.0328 
Nerven 0.0020 
Prostata 0.0274 
Sinnesorgane 0.0155 
65 Uterus n 0.0541 
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Elcklronischer Northern fur SEQ. ID. NO: 49 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Blase 


0.0585 


0.127B 


0.4576 


2. 


1852 


Brust 


0.0742 


0-0507 


. 1.4619 


0. 


6841 


Duenndann 


0-1134 


0.0000 


undef 


0. 


0000 


Eierstock 


0.0659 


0.0390 


1,6885 


0. 


5922 


Endokrines_Gewebe 


0.0324 


0.0176 


1.8437 


0. 


5424 


Gastrointestinal 


0.0690 


0.0694 


0.9940 


1. 


0060 


Gehirn 


0.0325 


0.0893 


0.3641 


2. 


7464 


Haematopoetisch 


0.0842 


0.0000 


undef 


0. 


0000 


Haut 


0.0404 


0.0000 


undef 


0. 


0000 


flepatisch 


0.1855 


0.0647 


2. 8677 


0. 


3487 


. Herz 


0.0435 


0.0962 


0.4516 


2. 


2145 


Hoden 


0.0230 


0.0585 


0.3936 


2. 


5408 


Lunge 


0-2275 


0.1063 


2.1396 


0. 


4674 


Ma gen-Spe i s e r oehre 


0.0483 


0.1993 


0.2424 


4. 


1252 


Muskel-Skelett 


0 . 0857 


ft mirt 
U. U /ZU 


'l 1 DQD 


u . 


o405 


Niere 


0.0706 


0.0274 


2. 57*7 4 


0. 


3880 


Pankreas 


0.0347 


0.2927 


0-1185 


8. 


4367 


Penis 


0.0090 


0.0267 


0.3369 


2. 


9678 


Prostata 


0.0458 


0.0319 


1.4331 


0. 


6978 


Uterus^Endometrium 


0.0270 


0.0000 


undef 


0. 


0000 


Uterus_Myometrium 


0.0610 


0.0204 


2.9927 


0. 


3341 


Uterus_allgemein 


0.0917 


0.0000 


undef 


0- 


0000 


B rus t-Hype rpl as ie 


0-0416 











Prostata-Hyperplasie 0.0386 

Samenblase 0.0267 

Sinnesorgane 0.1647 

WeisseJBlutkoerperchen 0-2497 

Zervix 0.0000 



FOETUS 

%Haeufigkeit 

Entwicklung 0.0000 

Gastrointenstinal 0.0916 

Gehirn 0.0500 

Haeraatopoetisch 0 . 08 65 

Haut 0.0000 

Hepatisch 0.0520 

Herz-Blutgefaesse 0.0178 

Lunge 0.0434 

Nebenniere 0.0000 

Niere 0.0432 

Placenta 0-1636 

Prostata 0.0000 

Sinnesorgane 0-0000. 



NORMIERTE/SUBTRABIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.1156 

Eierstock n 0.0000 

Eierstock^t 0.0354 

Endokrines_Gewebe 0.0490 

Foetal 0.0571 

Gastrointestinal 0 . 0854 

Haematopoetisch 0.0057 

Haut-Muskel 0.0097 

Hoden 0.0154 

Lunge 0.0491 

Nerven 0.0562 

Prostata 0.0615 

Sinnesorgane 0.0000 

Uterus n 0.1957 
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Bektronischcr Northern fur SEQ. ID. NO: 50 



10 



15 



20 





NORMAL 


TUMOR 


Verhaeltnisse 




%Baeufigkeit %flaeufigkeit N/T 


T/N 


Blase 


0.0546 


0.1099 


0.4967 


2.0135 


Brust 


0.0563 


0.0489 


1.1517 


0.8683 


Duenndarm 


0.1380 


0.0331 


4.1708 


0.2398 


Eierstock 


0.0599 


0.0650 


0.9210 


1.0858 


Endo kr i nes__Ge webo 


0.0238 


0.0351 


0.6792 


1. 4722 


Gastrointestinal 


0.1322 


■ '0.1804 


0.7328 


1. 3647 


Gehirn 


0.0229 


0.0452 


0.5072 


1. 9714 


Haematopoetisch 


0.0241 


0.0000 


undef 


0.0000 


Haut 


0.1689 


0.0000 


undef 


0.0000 


Hepatisch 


0.0285 


0.0518 


0.5515 


1.8133 


Herz 


0.1166 


0.1649 


0.7067 


1.4150 


Hoden 


0.0115 


0.0117 


0.9839 


1.0163 


Lunge 


0.1070 


0.1329 


0.8050 


1.2422 


Magen-Speiseroehre 


0.1450 


0.0613 


•2.3635 


0.4231 


Muskel-Skelett 


0.0685 


0.0240 


2 . 8555 


0.3502 


Niere 


0.0570 


0.0753 


0.7570 


1.3210 


Pankreas 


0,0165 


0.1491 


0.1108 


9.0256 


Penis 


0.0779 


0.0267 


2.9202 


0.3424 


Prostata 


0.0610 


0.0255 


2.3885 


0.4187 


Uterus_Endometrium 


0.0338 


0.0000 


undef 


0.0000 


Oterus_Myametrium 


0.0991 


0.0340 


2.9179 


0.3427 


Uterus_allgemein 


0.0509 


0.1908 


0.2669 


3.7471 


Brust-Hyperplasie 


0.0064 









25 

Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
30 Weisse_Blutkoerperchen 
Zervix 



0.0386 
0.0801 
0.0588 
0.0616 
0.1810 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeuf igkeit 

0.0139 

0.0194 

0.0000 

0.0275 

0.0000 

0.0000 

0.0142 

0.0145 

0.0000 

0.0247 

0.0364 

0.0499 

0.0000 



50 

NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0068 

Eierstock^n 0.0000 

55 Eierstock_t 0.0101 

Endokrines_Gewebe 0-0000 

Foetal 0.0064 

Gastrointestinal 0.0976 

Jlaematopoetisch 0.0057 

~" Haut-Muskel 0.0259 

60 Hoden 0.0309 

Lunge 0.1802 

Nerven 0.0050 

Prostata 0.0274 

Sinnesorgane 0.0000 

6S Oterus n 0.0125 
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2.2 Fisher-Test 

Um zu enlschcidcn, ob eine Partial-Sequenz S eincs Gens in einer Bibliothek fiir Normal-Gewcbe significant hiiufiger 
oder seltener vorkommt als in einer Bibliothek fur entartetes Gewebe, wird Fishers Exakter Test, ein statistisches Slan- 
dardverfahren (Hays, W. L., (1991) Statistics, Harcourt Brace College Publishers, Fort Worth), durchgefuhrt 

Die Null-Hypothese lautet: die beiden Bibliotheken konnen beziigiich der Haufigkeit zu S homologer Sequenzen nicht 
untcrschieden werden. Falls die Null-Hypothcse mit hinreichend hoher Sicherheit abgelehnt werden kann, wird das zu S 
gehorcndc Gen als interessanter Kandidat fiir ein Krcbs-Gen akzcptiert, und cs wird im nacbsten Schritt versucht, eine 
Verlangerung seiner Sequenz zu crreichen. 

Beispiel 3 

Automatische Verlangerung der Parti al-Sequcnz 
Die automatische Verlangerung der Partial-Scquenz S vollzieht sich in drei Schritten: 

1. Ermittlung aller zu S homologen Sequenzen aus der Gcsamtmenge der zur Verfugung stehenden Sequenzen mit 
Hilfe von BLAST 

2. Assemblierung dieser Sequenzen mittcls des Standardprogramms GAP4 (Bonfield, J. K„ Smith, K. E, und Sta- 
den R. (1995), Nucleic Acids Research 23 4992-4999) (Contig-Bildung). 

3. Berechnung einer Konsens-Sequenz C aus den assemblicrten Sequenzen. 

Die Konsens-Sequenz C wird im allgemeincn langer sein als die Ausgangssequenz s. Ihr clektronischcr Northern-Blot 
wird dcmzufolge von dem Vui S abweichen. Ein emcuter Fisher-Test entscheidet, ob die Alternativ-Hypothese der Ab- 
weichung von einer gleicbmaBigen IZxpression in beiden Bibliotheken aufrechterhalten werden kann. 1st dies der Fall, 
wird versucht, C in gleichcr Wcise wie S zu verlangcrn. Diesc Iteration wird mit der jeweils crhaltencn Konscnsus-Sc- 
quenzen Q (i: Index der Iteration) fortgesetzt, bis die Alternativ-Hypothese verworfen wird (if Hq Exit; Abbruchkrite- 
rium I) oder bis keine automatische Verlangerung mehr mSglich ist (while Q > Cn ; Abbruchkriterium II). 

Im Fall des Abbruchkriteriums II bekommt man mit der nach der letzten Iteration vorliegenden Konsens-Sequenz eine 
komplette oder annahemd komplette Sequenz eines Gens, das mit hoher statistischer Sicherhcit mit Krebs in Zusammen- 
hang gebracht werden kann. 

Analog der oben beschricbenen Beispiele konnten die in der "ttibclle I beschriebenen Nukleinsaure-Sequenzen aus 
Blasentumorgewcbe gefunden werden. 

Ferner konnten zu den einzelnen Nukleinsaure-Sequenzen die Peptidsequenzen (ORFs) bestimmt werden, die in der 
Tabelle II aufgelistet sind, wobei wenigen Nukleinsaure-Sequenzen kein Pcptid zugeordnet werden kann und einigeo 
Nukleinsaure-Sequenzen mehr als ein Peptid zugeordnet werden kann. Wic bereits oben crwahnt, sind sowohl die crmit- 
tclten Nukleinsaure-Sequenzen, als auch die den Nukleinsaure-Sequenzen zugeordneten Peptid-Sequenzen Gegenstand 
der vorliegenden Erfindung. 



55 



DE 198 18 619 A 1 



Chromosomale 
Lokalisatlon 




| unbekannt' 


| unbekannt) 


I unbekanntl 


| unbekanntl 




unbekannt 


| unbekannt] 


I unbekanntl 


| unbekanntl 


I unbekanntl 


| unbekanntl 


| Chromosom 7| 






| unbekanntl 




c 

CO 

JA 
CO 

JQ . 

c 




I unbekannt) 






t unbekanntl 






unbekanntl' 


unbekannt); 








Lange der 
angemeldeten 
Sequenz In 
Basen 


CM 
O 
CM 


CD 


CM 
CD 
I s * 


CO 

v- 


| 1148| 


2407 


1471 


| 1732| 


o> 

CO 

oo 


o 
m 


| . 1467| 


m 

CD 
CO 


CD 


to 


CO 
CD 
CO 


CO 

m 
I s * 


3 

CO 


CO 


o 

CD 
CM 


2565 1 


T— 

CO 


2096 1 


13481 


CO 

in 

CO 


o> 

CO 


1632| 


29721 


CD 
CD 


o> 

CO 


Rf 


Lange des 
Ausgangs-EST In 
Basen 


CM 


CO 
CM 


CD 
CO 


in 

3 


CO 

o 

CO 


CO 


9 


CD 
CO 
x— 


o> 


o 
in 


CO 
CO 
CM 




co 


CO 
CO 


CD 
CD 


CO 


CO 
CM 


o 

CO 


o 

CO 


T— 
I s - 

CM 


CM 


CO 
CM 


CO 

s 


d> 
CM 


o> 
co 


CD 


tn 

CO 

T- 


CM 


CM 
CM 


CO 
CM 


Funktion 


Identlsch zum humanen IgQ alis V-D-J6 Region 


unbekannt) 


unbekanntl 


unbekanntl 


| unbekanntl 


Identlsch zum Kaposi Sarcoma-assozJIerten Herpesvirus 

Glycoprotein M 


Hyaluronectln (HN) Ist eln Glykoproteln, dass Hyaluron blndet 
und oft In humanen Tumoren gefunden wlrd 


| unbekannt! 


| unbekannt' 


| unbekannt 


| unbekannt 


| unbekannt 


I mitochondriales Enzym 1 


| Identlsch zum humanen Collagen t (alpha-1 Kette) | 


| identlsch zum huamen Keratin K7 (Typ II) 


| unbekanntl 


| Identlsch zum humanen Cofilin | 


das H19 Gen wird nur vom maternalen Chromosom exprlmiert 
und steilt mogllcherweise ein Tumor-Supressorgen dar 


Identlsch zum humanen IGF-2 Wachstumsfaktor| 


| - unbekanntl 


Identlsch zum humanen Ig V-L (Klon VL 29-1 ) | 


Identlsch zum humanen H19| 


| mitochondriales Enzym | 


| Identlsch zum humanen Anti-Hepatitis A IgG Variable Reglon| 


| Identlsch zum humanen Saposln | 


| Gen, dass durch IL6 induziert wird| 


| unbekanntl 


Identlsch zu Immunoglobulin M schwere Kette V Reglonl 


I Identlsch zur humanen Ig schweren Kette (variable RegIon)| 


Identlsch zur cDNA, die die leichte Kette eines monoklonalen 
Antlkdrpers kodiert, der gegen das humane Cytomegalovirus 

65 kD Protein gerichtet ist 


Wahrschelnllchkeit fur 
eine spezlflsche 
Expression Im 
Tumorgewebe % 


i 99.54| 


I 99.51 1 


| 99.2 1 


| 98.58| 


I 97.431 


CO 

o> 


' 95.29 


| 95.29| 


I 95.291 


CO 
CM 


| 91.23| 


I 91.231 


I 91.231 


| 91.23| 


1 1oo| 


o 
o 


I 99.97] 


o 
o 


O 

o 


| 99.56 1 


I 99.26 1 


§ 


| 98.77| 


o 
o 


CD 

CD 
CO 


1 94.051 


1 94.051 


o 
o 

v- 


o 
o 

T — 


o 
o 

x— 


Sequenz 
ID No.: 




CM 


CO 




in 


co 




CO 


o> 


o 




CM 


CO 




tn 


CD 




CO 


CD 


o 

CM 


T- 

CM 


CM 
CM 


CO 
CM 


3 


in 

CM 


CO 
CM 


N- 

CM 


00 
CM 


CD 
CM 


o 

CO 
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Chromosomale 
LokalisaLlon 


| unbekannt! 


| unbekanntl 








| unbekanntl 


















| unbekanntl 








■* 




| Lange der 
[ angemeldeten 
Sequenz In 
Basen 


| 1031 | 


CO 
CO 


v- 
lO 
CO 


CO 
CM 
00 


in 


I 1203| 


o 

CM 


CD 
^* 
CO 


CD 
CM 
CO 


2384 


CO 


to 

s 


I 22331 


co 


00 


1644 


! 1133] 


CO 
CO 
CO 


s 


s 


8 c c 

-o h- to 

CD CO CO 
COLU CQ 
<— i- 

— 1 C 
CO 
CO 
CO 


o 

T- 

CM 


CO 

T— 


CO 

oo 


h- 

CO 


CO 
CM 


CO 
o 

CM 


O 
CM 


m 

CO 

T- 


N. 
CO 
CM 


CO 


CO 
v- 


oo 

CD 


00 


CM 


o 

CM 
CM 


o 
o 

CM 


CM 
CD 
CM 


o 
oo 


§ 


O 
CM 


Funktion 


| " •' unbekannt] 


I mitochondrlales Enzym | 


| identisch zur Ig kappa lelchten Kette variable Region D1 1.| 


| Identlsch zum humanen hsp27| 


Identisch zur Ig schweren Kettevarlablen Region V(2-1) (v(h)-iv 

Famille). 


| unbekanntl 


Identlsch zur Immunoglobulin llelchten Kette variable Region 
(lambda-lilb Untergruppe) von IgM Rheuma-bezogener 

Faktoren. 


| identisch zu US-Patent | 


Identlsch zu einer Immunoglobulin lambda lelchten Kette 

(X57812) 


. identlsch zur humanen hPGi mRNA, die das "bone small 

proteoglycan P kodlert 


| identisch zu patentlerter Sequenz | 


| Identisch zu patentlerter Sequenz 


| identlsch zur CD24 "signal transducer mRNA"| 


Identisch zur humanen Ig schweren Kette variablen Region 

(Klon M49) 


| unbekannt! 


Identlsch zur Ig Alpha 1- Alpha 2m=lmmunoglobulin A1-A2 
lambda hybrid GAU schwere Kette {secreted alpha chain} 


| identisch zu Pro-alpha 2(l) collagen (COL1A2) Gen| 


| Identisch zu humanen NIC (Natural Inhibitor of Collagenase)] 


| Identisch zur humanen Ferritin L Kette] 


Identlsch zum humanen Caicyclin Gen (auch Prolactin- 
Rezeptor assozllertes Protein) 


Wahrscheinlichkelt fiir 
eine spezifische 
Expression Im 
Tumorgewebe % 


' 90.26 1 


90.26| 


o 
o 

T- 


99.76 1 


o 
o 


| 9.3.95| 


' 93.95 


o 
o 

T- 


o 
o 


99.42 


| 99.99! 


I 99.771 


| 99.111 


97.85 


| 94.61 


o 
o 

v— 


! 93.7 1 


99.54 1 


, 99.28] 


97.9 
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TABELLEH 



DNA-Sequenzen 


Peptid-Sequenzen (ORFs) 


Seq. ID. No. 


Seq. ID. No. 


2 


128 






129 






130 




3 


131 






132 






loo 




A 
** 


1o4 






loo 






loo 




e 
o 


137 






138 






139 




7 


140 






141 






142 




B 


143 




9 


144 






.145 






146 




10 


147 






148 






149 




11 


150 






151 






152 




12 


153 






154 






I DO 




13 


156 






157 






158 




16 


159 






160 






161 
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uNA-oequenzen 

Can Ift Kin 


Peptid:Sequenzen (ORFs) 
Seq. ID. No. 


1o 


162 






163 






164 






165 




ZU 


166 






167 






168 




ZO 


169 






170 




ZD 


171 






172 




27 


. 173 




31 


174 




32 


175 






176 






177 




36 


178 






179 






180 




45 


181 






182 






183 





Die crfinderischen Nukleinsaure-Sequenzen Seq. ID No. 1 bis Scq. ID No. 50 der ermittelten Kandidatengene und die 
ermilleilen Aminosaure-Sequenzen Scq. ID No. 51-106 werden in dem nachfolgenden Sequenzprotokoll beschrieben. 

Sequenzprotokoll 

(1 ) ALLGEMEINE INFORMATION: 



(i) ANMELDER: 

(A) NAME: metaGen - Gesellsqhaft fur Genomforschung mbH 
(BJ'&TRASSE: IhnestrassB 63 
(C) STADT: Berlin 

(E) LAND: Deutschland 

(F) POST CODE (ZIP): D-141 95 

(G) TELEFON: (030)-841 3 1 673 

(H) TELEFAX: (030)-8413 1674 
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(ii) TITEL DER ERFINDUNG: Menschliche Nukleinsaure-Sequenzen aus 

Blasentumorgewebe 

(iii) Anzahl der Sequenzen: 106 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1 .25 (EPO) 

(2) INFORMATION UBER SEQ ID NO: 1 : 

(i) SEQUENZ CHARAKTERISTIK: . 

(A) LANGE: 202 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 1 

tgagagtcat ggacctcctg cacaagaaca tgaaacacct gtggttcttc ctcctcctgg 60 
tggcagctcc jcagatgggtc ctgtcccagg tgcagctaca gcagtggggc gcaggactgtl20 
tgaagccttc ggagaccctg tccctaacat gcgctgtctc cggtgactct tccagtacttl80 
actactggga ttggatccgc ca 202 

(2) INFORMATION UBER SEQ ID NO: 2: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1926 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TQPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 
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(iii) HYPOTHETISCH: NE1N 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 2 

ttgcgatggc tgatggactg tggctctcta accaaaggac cctagcgggc tcaacaattg 60 

tcaagagcag ttggtggttc tgaatacaat cctcagccaa ggatccctcc tgtgttacag 120 

atggatcagc taaaacaagc caacactgaa gacacaaaga atgaggttag gttcattgaa 180 

accagggtaa cacctgtgga tgagctaaac acaaagatga caatgacctt gtaccaggta 240 

tagaagctca gagacatgcc tgcaaaatga aatccctgag gaattttgca gctacccaga 300 

gatacgtggt tcaaattaaa atgtctgacg gatcactcat ttgaggaaca gcacatcagc 360 

ttcgcccttt acgtggacaa taggtttttt actttgacgg tgacaagtct ccacctggtg 420 

ttccagatgg gagtcatatt cccacaataa gcagccctta ctaagccgag agatgtcatt 480 

cctgcaggca ggacctatag gcacgtgaag atttgaatga aagtacagtt ccatttggaa 540 

gcccagacat aggatgggtc agtgggcatg gctctattcc tattctcaaa ccatgccagt 600 

ggcaacctgt gctcagtctg aagacaatgg acccacgtta ggtgtgacac gttcacataa 660 

ctgtgcagca . catgccggga gtgatcagtc agacatttta atttgaacca cgtatctctg 720 

ggtagctaca aaattcctca gggatttcat tttgcaggca tgtctctgag cttctatacc 780 

tgctcaaggt cagtgtcatc tttgtgttta. gctcatccaa aggtgttacc ctggtttcaa 840 

tgaacctaac ctcattcttt gtgtcttcag tgttggcttg ttttagctga tccatctgta 900 

acacaggagg gatccttggc tgaggattgt atttcagaac caccaactgc tcttgacaat 960 

tgttaacccg ctaggctcct ttggttagag aagccacagt ccttcagcct ccaattggtgl020 

tcagtactta ggaagaccac agctagatgg acaaacagca ttgggaggcc ttagccctgcl080 

tcctctcaat tccatcctgt agagaacagg agtcaggagc cgctggcagg agacagcatgll40 

tcacccagga ctctgccggt gcagaatatg aacaatgcca tgttcttgca gaaaacgcttl200 

agcctgagtt tcataggagg taatcaccag acaactgcag aatgtagaac actgagcaggl260 

acaactgacc tgtctccttc acatagtcca tatcaccaca aatcacacaa caaaaaggagl320 

aagagatatt ttgggttcaa aaaaagtaaa aagataatgt agctgcattt ctttagttatl380 

tttgaagccc caaatatttc ctcatctttt tgttgttgtc atggatggtg gtgacatggal440 

cttgtttata gaggacaggt cagctgtctg gctcagtgat ctacattctg aagttgtctgl500 

aaaatgtctt catgattaaa ttcagcctaa acgttttgcc gggaacactg cagagacaatl560 

gctgtgagtt tccaacctca gcccatctgc gggcagagaa ggtctagttt gtccatcaccl620 

attatgatat caggactggt tacttggtta aggaggggtc taggagatct gtcccttttal680 

gagacacctt acttataatg aagtacttgg gaaagtggtt ttcaagagta taaatatcctl740 

gtattctaat gatcatcctc taaacatttt atcatttatt aatcctccct gcctgtgtctl800 

attattatat tcatatctct acgctgcaaa ctttctgcct caatgtttac tgtgcctttgl860 

tttttgctag tgtgtgttgt tgaaaaaaaa aacattccct gcctaagtta gttttggcaal920 

agtatt 1926 

(2) INFORMATION UBER SEQ ID NO: 3: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) tANGE: 762 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editieoing 

hergestellte partielle cDNA 
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(iii) HYPOTHETISCH: NEIN 



(iii) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xl) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 3 

ctccactgca accacccaga gccatggctc cccgaggctg catcgtagct gtctttgcca 60 
ttttctgcat ctccaggctc ctctgctcac acggagcccc agtggccccc atgactccttl20 
acctgatgct gtgccagcca cacaagagat gtggggacaa gttctacgac cccctgcagclBO 
actgttgcta tgatgatgcc gtcgtgccct tggccaggac ccagacgtgt ggaaactgca240 
ccttcagagt ctgctttgag cagtgctgcc cctggacctt catggtgaag ctgataaacc300 
agaactgcga ctcagcccgg acctcggatg acaggctttg tcgcagtgtc agctaatgga360 
acatcagggg aacgatgact- cctggattct ccttcctggg tgggcctgga gaaagaggct420 
ggtgttacct gagatctggg atgctgagtg gctgtttggg ggccagagaa acacacactc480 
aactgcccac ttcattctgt gacctgtctg aggcccaccc tgccgctgcc ctgaggaggc540 
ccacaggtcc ccttctagaa ttctggacag catgagatgc gtgtgctgat gggggcccag600 
ggactctgaa ccctcctgat gacccctatg gccaacatca acccggcacc accccaaggc660 
tggctgggga acccttcacc cttctgtgag attttccatc atctcaagtt ctcttctatc720 
caggagcaaa gcacaggatc ataataaatt tatgtacttt aa 762 

(2) INFORMATION UBER SEQ ID NO: 4: 

(j) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 918 Basenpaare 

(B) TYP: Nuklelnsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 
hergestellte partielle cDNA 



(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 4 



ctcgagccgc tcgagccgat tcggctcgag ccttcccgct ccctgcttgc aaagtggttg 60 
tgccccaagg tccgcctcca ggccacgtgg gtgctgcggg ccaagctttc ccttcctttgl20 
agagaggttt ccgctgtagg agcagagctt ccgggctgcg ctcttcgttg cccagtttccl80 
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gctcagtggt 
taggcgccat 
ccgagaagcg 
tcgagcattg 
tgcgcctgga 
cttcgaggtg 
gaaggggccc 
gaagtacctg 
tgatgttgga 
gtttcttccc 
aaaagttact 
gaaaaaagaa 
gccgcggggc 



cgcgtctccg 
ggctccccgc 
agagaagctg 
cactagctga 
ggccccagag 
acgctgctgc 
ccacgcaaac 
tcgtagggag 
acattaatga 
tgccagaaat 
ggatttggtt 
aaagaaaaaa 
gcggcggc 



ccccccaccc 
gggaggaagc 
gcgaacggcg 
cgcgtctatg 
cttccagtaa 
gcccggacgg 
tcaaattccc 
atttgggtag 
tggaacatgg 
gaaggttcag 
taataaaagt 
agaaagaaga 



accagtcccg^ 
gtaaggctga 
gggagggaat 
ggcgcaacgc 
aggtgaaccc 
cagcagtgcg 
tgagcctcaa 
aagccctcat 
ccaaacttca 
ttatgaggca 
tgaaataaag 
aaaaagaaag 



ctgcattctc 
ggccc,C£gtg 
ggaggaggcg 
cgcggccctg 
gacgaagccc 
gaactctgga 
gaggtggtgg 
gctgagcttt 
gtcatgatcc 
accctctagt 
taaaagaaaa 
gagaagcgag 



ggcpgggctc2<0 
gtcgccgtagSGO 
accgttgtta360 
agccaggcgc420 
cggagggcag480 
ctgggattaa540 
aagagttgaa600 
gtgtccctgg660. 
tgaagccatg720 
aaggcattgt780 
aaaaaaaaga840 
agaaagggag900 
918 



(2) INFORMATION OBER SEQ ID NO: 5: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1 146 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 5 

tcagtttagt ggcagggtgg ttttttaatt ttcttctgtg gctggatttt tgttgtttgt . 60 

ttaaataact cttctgggaa gttggtttat aagcctttgc caggtgtaac tgttgtgaaa 120 

tacccaccac taaagttttt taagttccat attttctcca ttttgccttc ttatgtattt 180 

tcaagattat tctgtgcact ttaaatttac ttaacttacc ataaatgcag tgtgactttt 240 

cccacacact ggattgtgag gctcttaact tcttaaaagt ataatggcat cttgtgaatc 300 

ctataagcag tctttatgtc tcttaacatt cacacctact ttttaaaaac aaatattatt 360 

actattttta ttattgtttg tcctttataa attttcttaa agattaagaa aatttaagac 420 

cccattgagt tactgtaatg caattcaact ttgagttatc ttttaaatat gtcttgtata 480 

gttcatattc atggctgaaa cttgaccaca ctattgctga ttgtatggtt ttcacctgga 540 

caccgtgtag aatgcttgat tacttgtact cttcttatgc taatatgctc tgggctggag 600 

aaatgaaatc ctcaagccat caggatttgc tatttaagtg gcttgacaac tgggccacca 660 

aagaacttga acttcacctt ttaggatttg agctgttctg gaacacattg ctgcactttg 720 

gaaagtcaaa atcaagtgcc agtggcgccc tttccataga gaatttgccc agctttgctt 780 

taaaagatgt cttgtttttt atatacacat aatcaatagg tccaatctgc tctcaaggcc 840 

ttggtcctgg tgggattcct tcaccaatta ctttaattaa aaatggctgc aactgtaaga 900 

acccttgtct gatatatttg caactatgct cccatttaca aatgtacctt ctaatgctca 960 

gttgccaggt tccaatgcaa aggtggcgtg gactcccttt gtgtgggtgg ggtttgtgggl020 

tagtggtgaa ggaccgatat cagaaaaatg ccttcaagtg tactaattta ttaataaaca!080 
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ttaggtgttt gttaaaaaaa aaaaaaaaaa aaaaaaatag; ggtgggt^gt atttjc,ttgtall40 
ttggtt ' 1146 

(2) INFORMATION UBER SEQ ID NO: 6: 

(i) SEQUENZ CHARAKTERISTIK: 
(A) LANGE: 2407 Basenpaare 

' (B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOlTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 6 

gagtgagtga gtgtgttgca tcgaattaag gactcttgaa gagaagagag gtccattcag 60 
ggttgtccag attgaagtga ggtctcacgg tgaaaagaaa aggaaaatat tcagactctc 120 
ttgaaatcca aagagcaaga agtaaatgaa cttctgcaaa aattccagca agctcaggaa 180 
gaacttgcag aaatgaaaag atactctgag agctcttcaa aactggagga agataaagat 240 
aaaaagataa atgagatgtc gaaggaagtc accaaattga aggaggcctt gaacagcctc 300 
tcccagctct cctactca^c aagct<?atcc aaaaggcaga gtcagcagct ggaggcgctg 360 
cagcagcaag tcaaacagct ccagaaccag ctggcggaat gcaagaaaca acaccaggag 420 
gtcatatcag tttacagaat gcatcttctg tatgctgtgc agggccagat ggatgaagat 480 
gtccagaaag tactgaagca aatccttacc atgtgtaaaa accagtctca aaagaagtaa 540 
agtggattcc ttggcaggac actgcccctt gtcatctgtc tttgtgttag atccagagtt 600 
gtcggcagcc gctgccattg ttctcattcg tggtatgcac tgtggcctag cgtagttctt 660 
ccctttccaa aggtttctga ggacttctcc caggagaaga ctgcccgcct cagaactgct 720 
tagagacttc aaaccagcag aggtgaaagt ccctgtcatc ccttcagatt ccagagctgg 780 
gatcagccat gcccagaggt ctggtcctga tgctggcagg ggggccccct cctccatccc 840 
tgactggctg agtggcttta tcaccaccga gtgatgtgct gaggcctcct gcagtgaatg 900 
ctccttccat tcctgtactc gggcagtgcc attcagcaca ggagagctct ttttgccttt 960 
ggctttcaat tccaaaacat gatttaattt ctaactaaat tagtatggca ctagttatgal020 
agtatctgct taaaaccqt^t catcatgata tcctgtggat ttaaaaactc taattccatgl080 
ttttcttccc atctgcctta tatatctcat caccctgctt atcaatattc agtttgatgall40 
gcactattaa ctaaaatatg* aaacttaaaa acaaaagcaa gttgtcctta aaagttctttl200 
ttttaagtaa attgttgaca tactgcaaat tttctatgca aacttgcctc ctgctgttatl260 
ctgtgaagct caggaaatcc aaacatttgt gtttcaacaa gggacagtaa actgtgtgttl320 
tacagccaaa agaaatgcct catagttctt aacctcaact tttgtagaag tatttttttcl380 
tctgtaatat ttttattggc tcataaagat gttttcatat ctgaactcct aaataagtgal440 
aattacagta gattatatta acaaaatact ttttaggtag ccatgcttga gactttttaalSOO 
aaatataact ttttccttaa agttttcagc tatagcaaaa ggtagttatg tatgccagacl560 
ctaatatgag ctgccaccaa cacccctaga actttcagcc atggtgtctt cagaattgtal620 
gcgcatttct gaatctagca aatcctcctt ttacccgttg aatgttttga atgccctgacl680 
tctaccagcg cccataaatg atctctagaa ggactgttag taccaatctg tttttcaact!740 
ttgaagctaa aaaccctgat atggtaatat tatggtgcat agcagaggtc tcggaaaaaal800 
aatatttctg ttcactttac tttcaggtta aaaatgtttc taacacgctt gcaacttcccl860 
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ttatggcatt 
ctgaaagtag 
gtgcttggct 
gcaaaaatgg 
atttcaaaaa 
cacaacagcc 
ttttgtagga 
cgagtactct 
taatttacag 
atgaaaa 



aatcttgttg 
ggcctgctct 
gtgctggatg 
ctttaaagct 
tgctccaaag 
ccttctgccc 
tgcctgacga 
ggttaaatat 
aagttattgg 



agggagagag 
gacagggccq 
atatgttgat 
tggtgttact 
aaatgtgaaa 
ccgcacagaa 
ggtgtagcct 
tgaaaagtta 
agaaaataaa 



acagaatcct 
atgt^ccacff 
ctgtattgga 
tttcttaagt 
ggaccttttg 
atgctgcaga 
tttatcttgt 
tatgctgtag 
cttgtttcat 



ggactctcca 
* eg get get t 3 
taaggaccaa 
tgtttaatta 
tcacagcact 
gtatataaaa 
ttccggatgc 
tttttagtat 
tttgcaaaaa 



aa-^tatttcal923 
gcctcagtygl330 
tgacagcaaa2040 
tagttaagca2100 
tcagaaaata2160 
cttgagacat2220 
atatttatta2280 
tttgtctttg2340 
aaaaaaaaaa2400 
2407 



(2) INFORMATION UBER SEQ ID NO: 7: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1471 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 



(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 7 

ctcgtgcaac ccggcggctc ctgcagcggt ggtcggctgt tgggtgtgga gtttcccagc 60 
gcccctcggg tccgaccctt tgagcgttct gctccggcgc cagctacctc gctcctcggc 120 
gccatgacca caaccaccac cttcaaggga gtcgacccca acagcaggaa tagctcccga 180 
gttttgcggc ctccaggtgg tggatccaat ttttcattag gttttgatga accaacagaa 240 
caacctgtga ggaagaacaa aatggcctct aatatctttg ggacacctga agaaaatcaa 300 
gcttcttggg ccaagtcagc aggtgccaag tctagtggtg gcagggaaga cttggagtca 360 
tctggactgc agagaaggaa ctcctctgaa gcaagctccg gagacttctt agatctgaag 420 
ggagaaggtg atattcatga aaatgtggac acagacttgc caggcagcct ggggcagagt 480 
gaagagaagc ccgtgcctgc tgcgcctgtg cccagcccgg tggccccggc cccagtgcca 540 
tccagaagaa atccccctgg cggcaagtcc agcctcgtct tgggttagct ctgactgtcc 600 
tgaacgctgt cgttctgtct gtttcctcca tgcttgtgaa ctgcacaact tgagcctgac 660. 
tgtacatctc ttggatttgt> ttcattaaaa agaagcactt tatgtactgc tgtctttttt 720 
ttttttcttt tgaagaacag gtttctctct gtccttgact cttgggtctg tgggccatgg 780 
catgagtgtt ttctagtagt agattggagg gaaagctttg tgacacttag tactgtgttt 840 
ttaagaagaa ataatttggt tccagatgtg ttagaggatc ttttgtactg aggtttttaa 900 
cactttactt gggtttacca agcctcaact ggacagacca taaacagtcc acaggcaccg 960 
ttcctgccag gccccaaccc acagggagtc tctccgcaga gccttcttgg tgttgccctal020 
acttgccagt ggcctttgct cagagcctcc tcctgtgaca tgtgaacaat gaagaggcctl080 
gcgcctcctg ccttgccgcc tgcaaagcaa agaaactgcc ttttattttt taaccttaaail40 
aagtagccag atagtaacaa gactggctgg ctgatgagca aagcctttgc tctcacgcagl200 
aggaaggctt ggatgtacaa tgaaactgcc tggaactaaa agcagtgaag caagggaggcl260 
aatcacactg aagcgggtct tcctccagga acggggtccc acaggcgtgt tgttttaaatl320 
aacctgatgc tgtgtgcatg atgctggtgc ttgaccatga aaggaaagtc tcatccttaal380 
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aatgtgttgt acttcacaat cctggactgt tgcttcaagt aaaccatatc cacattttgal4 40 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a • 1471 

(2) INFORMATION UBER SEQ ID NO: 8: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1 732 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Edrtierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 8 



gcagaaccta cgcctgacgg gcccggcggc ggctgagccg cgctgcgcag cgacgcggga 60 
atgaagcggg cgctgggcag gcgaaagggc gtgtggttgc gcctgaggaa gatacttttc 120 
tgtgttttgg ggttgtacat tgccattcca tttctcatca aactatgtcc tggaatacag 180 
gccaaactga ttttcttgaa tttcgtaaga gttccctatt tcattgattt gaaaaaacca 240 
caggatcaag gtttgaatca cacgtgtaac tactacctgc agccagagga agacgtgacc 300 
attggagtct ggcacaccgt ccctgcagtc tggtggaaga acgcccaagg caaagaccag 360 
atgtggtatg aggatgcctt ggcttccagc caccctatca ttctgtacct gcatgggaac 420 
gcaggtacca gaggaggcga ccaccgcgtg gagctttaca aggtgctgag ttcccttggt 480 
taccatgtgg tcacctttga ctacagaggt tggggtgact cagtgggaac gccatctgag 540 
cggggcatga cctatgacgc actccacgtt tttgactgga tcaaagcaag aagtggtgac 600 
aaccccgtgt acatctgggg ccactctctg ggcactggcg tggcgacaaa tctggtgcgg 660 
cgcctctgtg agcgagagac gcctccagat gcccttatat tggaatctcc attcactaat 720 
atccgtgaag aagctaagag ccatccattt tcagtgatat atcgatactt ccctgggttt 780 
gactggttct tccttgatcc tattacaagt agtggaatta aatttgcaaa tgatgaaaac 840 
gtgaagcaca tctcctgtcc cctgctcatc ctgcacgctg aggacgaccc ggtggtgccc 900 
ttccagcttg gcagaaagct ctatagcatc gccgcaccag ctcgaagctt ccgagatttc 960 
aaagttcagt ttgtgccctt tcattcagac cttggctaca ggcacaaata catttacaagl020 
agccctgagc tgccacggat actgagggaa ttcctgggga agtcggagcc tgagcaccagl080 
cactgagcct. ggccgtggga aggaagcatg aagacctctg ccctcctccc gttttcctccll40 
agtcagcagc ccggtatcct> gaagccccgg ggggccggca cctgcaatgc tcaggagcccl200 
agctcgcacc tggagagcac ctcagatccc aggcggggag gcccctgcag gcctgcagtgl260 
cccggaggcc tgagcatggc tgtgtggaaa gcgtgggtgg caggcatgtg gctctccttgl320 
ccgcccctoa acctgagatc ttgttgggag acttaatggc agcaggcagc catcactgccl380 
tggttgatgc tgcactgagc tggacagggg gagtccgggc aggggactct tggggctcggl440 
gaccatgctg agctttttgg caccacccac agagaacgtg gggtccaggt tctttctgcalSOO 
ccttcccagc acatgcagaa tgactccagt ggttccatcg tcccctcctg ccctgtgtacl560 
ctgcttgcct ttctcagctg ccccacctcc cctgggctgg cccactcacc cacagtggaal620 
gtgcccggga tctgcacttc ctcccctttc acctacctgt acacctaacc tggccttagal680 
ctgagcttta tttaagaata aaatcgtggt ggtggtcaaa aagacactct gc 1732 
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(i) SEQUENZ CHARACTERISTIC 

(A) LANGE: 989 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 9 



cggctcgagc 
gccctggctt 
ggtcctggca 
caccaggtac 
acggctgacg 
atcctctcca 
cagctggggc 
agccccttgc 
ggctggcggg 
gtgcacctgc 
tccctctcag 
atgagctccc 
atggagtcac 
ccctggcccg 
ggctcagggc 
ttggctgttt 
aaaaaaaaaa 



gtgatcgtcg 
cttggcgcga 
gaaatcagca 
taaaggagct 
ccaagtttga 
gtgcggccaa 
tgcccaaaga 
agaagcactt 
tggactacac 
ggctggaggt 
cagacaagtt 
tgggctgagg 
gccctctgaa 
gactctggcc 
tcctgaggac 
tcataagcag 
aaaaaaaaaa 



actcagctga 
tgaggttccg 
cgctggccaa 
gctgggacag 
gtcaggcgat 
gcacagtgtc 
gcacgcggcc 
gcgggtctgc 
cctgagctcc 
ggcagctgcc 
ccaggtcctc 
agaagggtgt 
ctgctcttcg 
tcccagatcc 
ctttcccagc 
gaaaaataaa 
aaaaaaaaa 



ccctgcggga 
gttctgtggt 
gatgtcctct 
gggattgatt 
gtgaaggcca 
gatggcgaat 
agcctgtgcc 
agcctacgca 
agcctgctgc 
ccagggaccc 
ctggcagaac 
tccaggcctg 
ggaggcagcc 
ccagctgcct 
attaccttcc 
cagaagtata 



ccggaaaa.ag 
gatctggact 
gtgaagttgc 
atgagaagat 
cagtggcagt 
ccttgtccag 
gctgttatga 
tgaataggtt 
aatccgtgga 
cagcccagcc 
tgaagcaggc 
tgtggagccg 
ctggttctag 
cacttctctc 
cttcccttga 
aaagagaaaa 



aaattcccgg 60 
gtcccgactgl20 
ggctgctctglBO 
cctgaagctc240 
gctgagtttc300 
tgaactgcag360 
ggagaagcaa420 
ggcaggtgtg480 
agagcccatg540 
tgtt gccat g60 0 
ccagaccctg660 
ccctgcccgt720 
gatgctgagg780 
ttgagaactt840 
aaggcaattg90 0 
aaaaaaaaaa960 
989 



(2) INFORMATION UBER SEQ ID NO: 10: 



(i) SEQUENZ CHARAKTERISTIK: : 

(A) LANGE: 150 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
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(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 10 

attttatgaa gttgaaaaat agctcacttt aaagctagtt ttgaagacgt gcagctgtga 60 
cttgggtctg gttgggggtg ttgtgttttg agtcagccgt tttcactccc actgaggttgl20 
tcagaacatg cagattgctt cgattttctc 150 



(2) INFORMATION UBER SEQ ID NO: 11: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1467 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs (lurch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 1 1 

cgaggaccgg ccttgcgagc ggcgacgact ataaaatggc gcgtgctgca acccgcgccc 60 

gcttcggaga gagaaatgct ggggtgcagc ttcaagctta ggaccaccca ccatgcctat 120 

ccaggtgctg aagggcctga* ccatcactca ttaagaacag aggaggctgc ctgttactcc 180 

tggtgttgca tccctccaga cactctgctg tttcctgcct aggcgtggct gcagccatgg 240 

ctaggaaagc gctgccaccc acccacctgg gccagagctg gttctgctcc tgctgcaggg 300 

acactgagot ggctatctcg gcgcttcggg caagaactgc aacaggctct cctgggtcct 360 

gcaggtgtac agccgggccc ctgccttgtg cctcagctct cgagagctgc tgctgccggg 420 

tgacctgatc caacctgata aggtgccatc ttcagctacc actgcaaggc cctgagggca 480 

acagcagcac ggcactgccc acccggctgc tgatggcctg gtgccagctg ggagtcctcc 540 

cggcacttcg aggccactga gccacccttc cagccccagc ccaccatgga caggggtatc 600 

cagcttcctc ctcaacctcg tcctctgccc ctgagccagt gacgcccaag gacatgcctg 660 

ttacccaggt cctgtaccag cactagctgg tcaagggcat gacagtgctg gaggccgtct 720 

tggagatcca ggccatcact . ggcagcaggc tgctctccat ggtgccaggg cccgccaggc 780 

caccaggctc atgctgggac ccaacccagt gcacaaggac ttggctgctg agccacacac 840 
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ccaggagaag gtggataagt gggctaccaa gggcttcstg caggctaggg gaggagcc3c .900 

ccccgcttcc ctattgtgac caggcctatg gggaggagct" gtccatacgc caccgtgaga SCO 

cctgggcctg gctctcaagg acagacaccg cctggcctgg tgctccaggg gtgaagcaggl020 

ccagaatcct gggggagctg ctcctggttt gagctgcatt caggaagtgc gggacatggtl080 

aggggaggca aaaagccttg ggcactaccc tccctgtgga gctgttcggt gtccgtcgagll40 

ctagccacac cctgacacca tgttcaaggg taccggaaga gaagggtgtc tgcccccaacl200 

ctcccctgtg ggtgtcactg gccagatgtc atgagggaag caggccttgt gagtggacacl260 

tgaccatgag tccctggggg gagtgatccc ccaggcatcg tgtgccatgt tgcacttctgl320 

cccaggcagc agggtgggtg ggtaccatgg gtgcccaccc ctccaccaca tggggccccal380 

aagcactgca ggccaagcag ggcaacccca cacccttgac ataaaagcat cttgaagcttl440 

ttaaaaaaaa aaaaaaaaaa aaaataa 1467 

(2) INFORMATION ClBER SEQ ID NO: 12: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 895 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 12 

ctcgagccgc tcgagaacct cacttcctta ctcctccaaa aagaagtggg gaaagaacca 60 

tcaaaccttt cctcctgact taccaaacca ggaaaacagc aggagagggt ggctcaggacl20 

ttagggacag ggtatagctt agatggtgga aagcaaagga gagcaggaag ttgtaaatcal80 

ctggctaatg agaaaaggag acagctaact ctaggatgaa gctgtgacta ggctggagtt240 

gcttccttga agatgggact ccttgggtat caagacctat gccacatcac actggggcta300 

gggaagtagg tgatgccagc cctcaagtct gtcttcagcc agggacttga gaagttatat360 

tgggcagtgg ctccaatctg tggaccagta tttcagcttt ccctgaagat caggcagggt420 

gccattcatt gtctttctct cctagccccc tcaggaaaga aggactatat ttgtactgta480 

ccctaggggt tctggaaggg aaaacatgga atcaggattc tatagactga taggccctat540 

ccacaagggc catgactggg aaaaggtatg ggagcagaag gagaattggg attttagggt600 

gcagtacgct caccctaaao ttttggtggc ctggggcatg tcttgaggcc cagactgtta660 

agcaggctct gctggcctgt ttactcgtca ccacctctgc acctgctgtc ttgagactcc720. 

atccagcccc aggcacgcca cctgctcctg agcctccact atctccctgt gacgggtgaa780 

cttcgtgtac tgtgtctcgg gtccatatat gaattgtgag cagggttcat ctattttaaa840 

cacagatgtt tacaaaataa agattatttc aaaccaaaaa gaaaaaaaaa aaaaa 895 



(2) INFORMATION UBER SEQ ID NO: 13: 

(i) SEQUENZ CHARAKTERISTIK: 
(A) LANGE: 467 Basenpaare 
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(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 1 3 

acccagcagc cctcgcgcgg tccggcacag cggacaccag gactccaaaa tggcgtcagt 60 
tgtaccagtg aaggacaaga aacttctgga ggtcaaactg ggggagctgc caagctggatl20 
cttgatgcgg gacttcagtc ctagtggcat tttcggagcg tttcaaagag gttactaccgl80 
gtactacaac aagtacatca atgtgaagaa ggggagcatc tcggggatta ccatggtgct24 0 
ggcatgctac gtgctcttta gctactcctt ttcctacaag catctcaagc acgagcggct300 
ccgcaaatac cactgaagag gacacactct gcaccccccc accccacgac cttggcccga360 
gcccctccgt gaggaacaca atctcaatcg ttgctgaatc ctttcatatc ctaataggaa420 
ttaacctcca aataaaacat gactggtacg tgtaaaaaaa aaaaaaa 467 



(2) INFORMATION UBER SEQ ID NO: 14: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 511 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN . 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) DRGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 14 
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actagttcta gatcgcgagc 
tctttttttt ttttttgtgg 
gggaggagag cgtgtgcggc 
ctgcctctgg cttctcaggc 
agctgcagcc catcctcccg 
tcagagacaa cttcccaaag 
agactctgta cctattttgt 
gctggaataa agcatgtgga 
aaaaaaaaaa aaaaaaaaaa 



ggccgccctt tttgttcaaa 
atggggactt gtgaattttt 
tccagcccag cccgctgctc 
ctctgctctc cgacctctct 
gcgccctcct agtctgtcct 
cacaaagcag tttttccccc 
atgtgtataa taatttgaga 
aatgacccca aacataaaaa 
aaaaaaaaaa a 



gtctattttt attctctttt 60 
ctaaaggtgc tatttaacatl20 
actttccacc ctctctccac!80 
cctctgaaac cctcctccac240 
gcgtcctctg tccccgggtt300 
taggggtggg aggaagcaaa360 
tgtttttaat tattttgatt420 
aaaaaaaaaa aaaaaaaaaa480 

511 



(2) INFORMATION UBER SEQ ID NO: 15: 

(i) SEQUENZ CHARACTERISTIC 

(A) LANGE: 1899 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblieaing und Editierung 

hergestellte partielle cDNA 



HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 15 

tctccaccct ccccttcctt ctttcttttt ccccttcttg cacgtggatc actcaggcct 60 

cagaatgagg ctgqtttatt ggaagctatt ctgacatcac tttccagact gtctcactgt 120 

cttgggacca ggcatgggag scaggggtgg gaatcttctt gtgattgtgg gtggtggctg 180 

gaggagtgga gtggtgggag gcggctcagt cgcgggcact cctgcgactg gcggatgcgg 240 

tccggatgga ataagccttc aggagcccag gacccgcact gctggagaag ctcagggcat 300 

tgctgcccat ggttcccccg agggtcagcc caatgccacc gccactgcta ctgccaccag 360 

tggaattcat cacagagata ttcacggctc ccactccatc tccagccaac cggctctcct 420 

cgccctccag cagcttgcgg taggtggcga tctcgatgtc cagggccagc ttcacgctca 480 

tgagttcctg gtactcacgc agctgccgtg ccatatcctg cttggcccgc tgcagggcgg 540 

cttccagctc ctcctgcttg, gcacgagcat ccttgagcgc cagctcccca cgctcctcag 600 

cctcggcaat ggcggcctcc aacttggcac gctggttctt gatgttgtcg atctcagcct 660 

gcagcctctg gatggcccgg ttcatctctg aaatctcatt cc^ggtattc cggaggtcgt 720 

ccccatgctt cccagcctgg gcctggaggg tctcaaactt ggtctggtac caggcttcag 780 

cctcagcccg gctgcatttg gccatctcct catactgcgc cttgacctca gcgatgatgc 840 

cgtccaggtc cagggagcga ctgttgtcca tggacagcac cacagatgtg tcggagatct 900 

gggactgcag ctctgtcaac tccgtctcat tgagggtcct gaggaagttg atctcatcat 960 

tcagggcatc caccttggcc tccagctcca ccttgctcat gtaggcagca tccacatcctl020 

tcttcagcac cacaaaetca. ttctcagcag ctgtgcggcg gttaatttca tcttcgtactl080 

tattcttgaa gtcctccacc acatcctgcc atctccgcag ctccgcctcc aggcggccccll40 

catccacctg cagtgcctca agctgacccc gaaggccagc aatctgggcc tcaaagatgtl200 

ctgggaggcg gctgctcttg gccgacttct gctcctgcag cagcgtccac ttggtctccal260 
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gcagcttgtt ctgctgctcc agaaaccgca cctt^tcgat .gaaguaggca aacttgttgt3 320 
tgagggtctt gatctgctcg ctctcctcct ggcgcacccg" ctggagggay gggtcggcgtljtfO 
ccagccgcag cggggccagc aggctctggt taatggtgac ctcgcggatg ccggcgcccal440 
. ccgggccccc ataggcagag cgcacggcca cgcgcggccg tgaggcgccg aggccgtagalSOQ 
ggctgctgct gccaaggccg ccggggcgag cggagctcag gcgcacctgg gcgccgcggcl560 
ccgagaaggc ggctgagcgc gaggtgaata ccggggagct gaagtggatg gacatggtggl620 
ctgggccggg atggacctag cggcgggcga ggaggagcgc gcactcgctg acctcggggal680 
cactccgcac cttttatccg cgggagccgg tgctgggctt ccacaggtag gggcggggctl740 
ggccgcgggc accgtttctc tgctgccagg cccctcctgc gcgtccgtcc gccctctgccl800 
cgcccgcccc gccgaagccc aggctttcag tccaagcagg gatggtccgg agtaggcaggl860 
agcgccatcc ctagacggcc gcagagaaca gcgggggac 1899 

(2) INFORMATION UBER SEQ ID NO: 1 6: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 758 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblieaing und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 1 6 

cggctcgagc ggctcgaggg gcaagaagaa patcacgtgc tgtgacaccg acttgtgcaa 60 
cgccagcggg gcccatgccc tgcagccggc tgccgccatc cttgcgctgc tccctgcactl20 
cggcctgctg ctctggggac ccggccagct ataggctctg gggggccccg ctgcagcccal80 
cactgggtgt ggtgccccag gcctctgtgc cactcctcac agacctggcc cagtgggagc240 , 
ctgtcctggt tcctgaggca catcctaacg caacptctgac catgtatgtc tgcacccctg300 
tgccccaccc tgaccctccc atggccctct cbaggactcc cacccggcag atcagctcta360 
gtgacacaga tccgcctgca gatggcccct ccaaccctct ctgctgctgt ttccatggcc420 
cagcattctc cacccttaac cctgtgctca ggcacctctt ccccfcaggaa gccttccctg480 
cccaccccat ctatgactt;g agccaggtct ggtccgtggt gtcccccgca cccagcaggg540 
gacaggcact caggagggcc cagtaaaggc tgagatgaag tggactgagt agaactggag600 
gacaagagtc gacgtgagtt> cctgggagtc tccagagatg gggcctggag gcctggagga660 
aqgqqocagg cctcacattc gtggggctcc ctgaatggca gcctgagcac agcgtaggcc720 
cttaataaac acctgttgga taagccaaaa aaaaaaaa 758 



(2) INFORMATION UBER SEQ ID NO: 17: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 302 Basenpaare 

(B) TYP: Nukleinsaure 
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(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte parb'elle cDNA . 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 17 

cggctcgacg gtctcgagat caagcatgaa ttgcaagcaa actgctacga ggaggtcaag 60 
gaccgctgca ccctggcaga gaagctgggg ggcagtgccg tcatctccct ggagggcaagl20 
cctttgtgag cccctttctg cgcccccttg cctgggagca tctgggcagg ccccaacaccl80 
ttgccctttg ggggtttgca gggctcgccc cctttcctgg ccagaaccgg gagggggctg240 
gggggggatt cccaggcagg gggggagggg ccaattccct tttcaacccc caggttgggc300 
ca 302 



(2) INFORMATION UBER SEQ ID NO: 1 8: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 824 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte parb'elle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 18 

ggcgaggaca gaggaggcgc gtccggcctt cctgaacacc ttaggctggt ggggctgcgg 60 
caagaagcgg gtctgtttct ttacttcctc cacggagtcg gcacactatg gctgcccctgl20 
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ggctcccaga acccacaaca tgaaagaaat ggtgccaccc ?gctcaagcc tgg^cctttg3 80 
aatccggaca caaaaccctc tagcttggaa atgaatatgc tgcsctttac aaccactgca2£0 
ctacctgact caggaatcgg ctctggaagg tgaagctaga ggaaccagac ctcatcagcc300 
caacatcaaa gacaccatcg gaacagcagc gcccgcagca cccaccccgc accggcgact360 
ccatcttcat ggccaccccc tgcggcggac ggttgaccac cagccaccac atcatcccag420 
agctgagctc ctccagcggg atgacgccgt ccccaccacc tccctcttct tctttttcat4B0 
ccttctgtct ctttgtttct gagctttcct gtctttcctt ttttctgaga gattcaaagc540 
ctccacgact ctgtttcccc cgtcccttct gaatttaatt tgcactaagt catttgcact600 
ggttggagtt gtggagacgg ccttgagtct cagtacgagt gtgcgtgagt gtgagccacc660 
ttggcaagtg cctgtgcagg gcccggccgc cctccatctg ggccgggtga ctggggcgcc720 
ggctgtgtgc ccgaaggcct caccctggcc cttcggcctt agtctgggaa ggttccgaac780 
cgaacatcaa gggaggcaag cctttcaagg catttccatt aatt 824 



(2) INFORMATION UBER SEQ ID NO: 19: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2190 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) S EQU ENZ-B ES CH REI BUNG: SEQ ID NO: 19 

tttttttttt tttttttttt tttttttttt tttttttttt 

tattgggatt gcaagcgtta caaggttaaa gacaaaaccc 
aatattagcg ttaaaggagc tgagttgagt caaacacggg 
..gcaggcacag gtgacattca gtgtttggcg tgggggtctt 
gacccaagag ggggcccccc actgaagaca ttggggacac 
agccacgact aggcacggag gtcagacagg cagcccgggc 
gggagagctg caaacctggg gacgcaaggg gctggtcggc 
actccggcga ggcagaatatf aacactgggt gggtgggtgt 
tttggggtgc ctcgaagcgt tttggatctc aggccaatgt 
tttggctctt tgggcttctg tccaatgttc cgaatggccc 
gggaccctTct gcgacgctcg agctcgagcc gaaatgaggg 
attgatctat tccccctctt tgtttcttgg ggcagttttt 
tatgcgctca gaaaccaaat taaacccccc ccccatgtaa 
caagaacgca cgaagccagc ctggagacca ccacgtcctg 
tgattggatt ttgtttttca tctgtccctg ttgcttgggt 
ctggggggca ctggccactg agcccccttg gagaagtcag 
tgtccggcct ggcttctggg gacagtggct ggtccccaga* 
gggttgggca gggtctcctc aggtgtcagg agggtgctcg 
ctggctggcc tgaggctggc cggaggggaa ggggctagca 

74 



tttcatgact tttaatgctt 60 
aagcatggga ttttgccgga 120 
ccgcaaggtg gaccgaggcg 180 
caggtgatgg cagaggaggg 240 
ggggaggaga caagatggag 300 
caggatggtt agtggcccag 360 
aagtgccccc gggaacaccc 420 
cctgacgaat gggcaggtaa 480 
gggttccaca attgtgacaa 540 
actcacaggg cgcttgccga 600 
aacccccaaa tttcatgtca 660 
tttttacccc tccttagctt 720 
caggggggca gtgacaaaag 780 
ccccccgcca tttatcgccc 840 
tgagttgagg gtggagcctc 900 
aggggagtgg agaaggccac 960 
agtcctgagg gcggagggggl020 
gaggccacag gagggggctcl080 
ggtgtgtaaa cagagggttc!140 
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catcaggctg 
ggtgacatct 
gggacctctt 
gcccacagag 
cccacgggtc 
ctcggcctct 
gcagcccgac 
gaccccgggg 
ctgtgagtgg 
tggaacccac 
tcgtcaggac 
gggcacttgc 
ccatcctggc 
tcctccccgt 
atcacctgaa 
ccttgcggcc 
ccatgcttgg 
tcatgaaatt 



gggcagggtg 
tgcccgcccc 
gggggcaagg 
caggagagct 
tggtgatgcc 
gggaggttta 
tagcagtcta 
gtcgtccatg 
gccattcgga 
attggcctga 
acccacccac 
cgaccagccc 
ccgggctgcc 
gtccccaatg 
gacccccacg 
cgtgtttgac 
gttttgtctt 
caaaaaaaaa 



gccgccttcc 
tcagcaccct 
tggggtgagg 
gccaggtctg 
atagcagcca 
cctcgccccc 
gaggcctgag 
ccagtccgcc, 
acattggaca 
gatccaaaac 
ccagtgttat 
cttgcgtccc 
tgtctgacct 
tcttcagtgg 
ccaaacactg 
tcaactcagc 
taaccttgta 
aaaaaaaaaa 



gcaoictcga 
gcctcgcctc" 
ccggggagta 
cccatcgacc 
ccaccgcggc 
acttgtgccc 
gcttctgggt 
tcagtcgcag 
gaagcccaaa 
gcttcgaggc 
attctgcctc 
caggtttgca 
ccgtgcctag 
ggggccccct 
aatgtcacct 
tcctttaacg 
acgcttgcaa 



ggaaccctc-.c 
caggaggtcc 
gggaggtcag 
aggttgcttg 
gcctagggct 
ccagctcagc 
cctggtgacg 
agggtccctc 
gagccaaatt 
accccaaatt 
gccggagtgg 
gctctcccct 
tcgtggctct 
cttgggtccc 
gtgcctgccg 
ctaatatttc 
tcccaataaa 



cccctccctcl20*) 
gaagctctgtl260 
gcgggtctgal320 
ggccccggagl380 
gcggcagggal4 4 0 
ccccctgcaclSOO 
gggctggcatl560 
ggcaagcgccl620 
gtcacaattgl680 
acctgcccatl740 
gtgttcccggl800 
gggccactaal860 
ccatcttgtcl920 
ctcctctgccl980 
cctcggtcca2040 
cggcaaaatc2100 
gcattaaaag2160 
2190 



(2) INFORMATION OBER SEQ ID NO: 20: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2565 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzei 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblieaing und Editierung 

hergestellte partielle cDNA • 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 20 

ctcccccacc tgtggcccgc aagccgtctg tgggagtccc ggaccccgcc tcccccagtt 60 

accctcgagc tgagcccctt actgctcctc ccaccaatgg gctccctcac acccaggaca 120 

ggactaagag ggagctggcg^ gagaatggag gtgtcctgca gctggtgggc ccagaggaga 180 

agatgggcct cccgggctca gactcacaga aagagctggc ctgaccacca ggcacctcac 240 

tggcactgct gacccatccc agaaacacaa tctcagggac ccgagcagct ccaaggacga 300 

gaggatacag cagacacaac ctaatagaga gggcgcctgc agccttaacc tccacggcct 360 

tcgatactta tgcaagcctg gtgttgctcc tgtcctcaga gtcatcctgc gctcatgcct 420 

tttcccgaat gggttcacct ctggcagttg ccgcttcagt cttggcctta gcctcatctt 480 

gaagtgggta gctggcggga gagggtggct gcgccccctg ctggccctga ggctgcagag 540 

ttgggagcag gacacctcac ctgagtttca ttttttttca tgtccaaacc atgcacatac 600 

tatagtccag aatcaaagca cttttgaaaa gtggctgcat ggccatcctc cagggcccag 660 

gaagttgcat tccaagggcc tgtttacatg gcagcagaat ccatccccgg cagtcagccc 720 

atagcttggg accagtctgt gccctcctgc ccagtccagt ttactcctct tggttcctga 780 

aggtggccaa gtcattgtgt tcccacaggc ttctctaggc tgggggcagg tgtggggctg 840 
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tggaattcca 
accaccctcc 
ccagggccca 
cacctgcccc 
aggagccttt 
ggagcttcct 
tctctcctca 
tgggggcagc 
ttctctagtc 
gtatcagcct 
cacctgtcta 
ccccccaggg 
tctggcaagg 
cgccccctca 
tctccagggt 
tcttcccttg 
agttggggaa 
ccccctagga 
gaatggctgg 
tggaactcta 
aaaggagaag 
tgacctagag 
aagtatatat 
tccattccct 
agatgttaat 
agggcagtgg 
acccaagtat 
atcttctacc 
agcagctgcc 



aagcacaaaa 
tgccttccag 
aagaaatggg 
ttgcctcctt 
gggctgagca 
tgagcagtgg 
tttcggtgca 
taacgctgag 
aatgaggggc 
ctcttactgt 
ggcaagctgg 
ccaccgcttc 
aaccttgctt 
ccaaggctgg 
gttttccact 
cctgtgggca 
agaggataat 
tccaggactg 
aggtgggaga 
tcctgttagg 
ggagggtcta 
taaatggaga 
atacatatat 
ctgccctgaa 
taaatgattg 
tctgggccca 
gggcctcttc 
ccccaccccc 
tcacacttgt 



ggtgcagagg 
ttctgccagg 
tgagcagtag 
gaccacactg 
agcactgagg 
gcccaggcct 
tgtcctttct 
tgacaaggat 
acccagtgct 
actctccggg 
cttccccatt 
tttcttgatc 
ttagcttcac 
gaacagaggg 
agtcactact 
gtggagagtg 
cagtgagcac 
ggtcaaagct 
gaacctgact 
atcttctgag 
gaagaggcag 
gaccaaaagc 
atatttcttt 
gcctgagtga 
aaacttgaaa 
catttagaag 
tgcctagtac 
cattacaggg 
gttctctcct 



ggarcggcct- 
tgctccatg- 
agtcatctcg 
gggtgggtgg 
aggtggatgg 
ggccctccac 
gcagctgcct 
gggaagccac 
tctagggcag 
aatgttaacc 
ggcccctgtg 
ctctttcctt 
caccaaggag 
gatgtggtga 
gtcttctcct 
ctgctgggtg 
tgttctgctc 
gcatgaaacc 
tctctttccc 
cttgtttccc 
cccttctttg 
ctctgatttt 
aaatttttga 
gacacatgaa 
aaggctactg 
ggaaaatgtt 
tgccaaaggg 
taaagggaac 
gagatggtcc 



tcctgtgcct 
"tg^Qgacaag 
gggcacttgg 
gcccccagca 
aagggagcat 
acttcattct 
ttcagcacag 
aggtgcattt 
gctgggtggt 
tttctatttt 
ggtccacagc 
aacagtgact 
agaggttgac 
gagccaggtt 
tgtagctaat 
tacgctgcac 
agagctcctg 
aggccctggc 
tctccctcct 
tgctgggtgg 
tcctctgggg 
taatttccat 
gtctttgata 
gaaaactgtg 
cttcttaatg 
ttgcctgctg 
actgttaagg 
cccagactag 
agctt 



caact caeca 900 
taggagactg 960 
cagtgtcaagl020 
cttcagaggcl080 
ctggagggggl 1 4 0 
ctgacctttcl200 
gtggttccacl260 
tactcaagtcl320 
ggtcccctagl380 
cagect gtgcl 440 
agcgtggctglSOO 
tgggctt gaglS 60 
atgacctcccl620 
cctctggcccl680 
caatcaatatl740 
ctgcccactgl800 
atctaccccal860 
agcaacctggl920 
ccaacattacl980 
gacagaggac2040 
taaatgagct2100 
aaaatgttag2160 
tgtctaaaaa2220 
tttcatttaa2280 
ttggggggac2340 
cacacattgg2400 
tgtcttgtcc2460 
gtgaggggcc2520 
2565 



(2) INFORMATION ClBER SEQ ID NO: 21: 



(i) SEQUENZ CHARAKTERIST1K: 

(A) LANGE: 461 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblieaing und Editierung 

hergestellte partielle cDNA 



(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 21 



gttcctccct agcggctcgc tcagaagcag agttctgggg tgtctccacc atggcctgga 60 
cccctctctg gctcactctc ctcactcttt gcataggttc tgtggtttct tctgagctgal20 
ctcaggaccc tgctgtgtct gtggccttgg gacagacagt caggatcaca tgccaaggagl80 
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acagcctcag aagctattat gcaagctggt accagcagaa gccaggacag gcccctgtac240 

ttgtcatcta tggtaaaaac aaccggccct cagggatcco . qgaccgattzc tctggctcca300 

gctcaggaaa cacagcttcc ttgaccatca ctggggctca ggcggaagat gaggctgact360 

attacttgta aactccccgg gacagcaagt gggtaaccaa tgtgggtatt ccggcgggag420 

ggacccaagc ttgacccgtt cttaaggtca gcccaaaggg. c 461 

(2) INFORMATION OBERSEQ ID NO: 22: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2096 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(II) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 
hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 

(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 22 

cagctggttg gacgagggga gctggtgggc 60 
ggtgtagctg gcagcagcgg gcaggtgagg 120 
cctccccagg cctcgcttcc ccagccttct 180 
ggcggggaga agcagacacc ctcccagcag 240 
aaggtgacat cttctcgggg ggagccgaga 300 
tccaggcaga aagagcaaga gggcagggag 360 
gcacgtgggg tggtacccca ggcctgggtc 420 
gaggggtccc cagctgccac ctcacccacc 480 
gctccggcac ggctttctca ggcctatgcc 540 
aggcagtgct cggggagttg cagcaggacg 600 
gggccccggg acattgcgca caaaggaggc 660 
cacgaggcac tgcggcccag ggtctggtgc 720 
tgtatgccct caccgctcag cccctggggc 780 
agtcaagggc atggggcgag accagactag 840 
ctcaggaggg aggatggtgc aggcaggggt 900 
actggcctcc agagcccgtg gccaaggcgg 960 
gtgcctcagc gttcgggctg gagacgaggcl020 
cagctgccga aggcaagacg ccaggtccggl080 
cggtgtgacg gcgaggacag aggaggcgcgll40 
gggctgcggc aagaagcggg tctgtttcttl200 
ctgccctctg ggctcccaga acccacaacal260 
tgggcctttg aatccggaca caaaaccctcl320 
aaccactgca ctacctgact caggaatcggl380 
ctcatcagcc caacatcaaa gacaccatcgl440 
accggcgact ccatcttcat ggccaccccclSOO 
atcatcccag agctgagctc ctccagcgggl560 
tctttttcat ccttctgtct ctttgtttctl620 
gattcaaagc ctccacgact ctgtttccccl680 
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atcccggtca cttttggtta caggacgtgg 
agggtttgat cccagggcct gggcaacgga 
accccatctg • ccgggcaggt gagtcccttc 
gaaagaagga ggtttagggg atcgagggct 
aggggcagga tgggggcagg agagttagca 
ctgcgcaagg ctggggggtt atgggcccgt 
ggagcacagg ggtggccagc gtagggtcca 
agacagggac atggcagggg acacaggaca 
gcaattcatt tagtagcagg cacaggggca 
ggagcctcga gggctggaga gcgggaagac 
tcaccaggag ggcgaacggc cacgggaggg 
tgcaggggct cggcctgcgg gcgccgg.tcc 
ggagagggcc cacagtggac ttggtgacgc 
tggcttggca gacagtacag catccagggg 
gcgaggcggg cggggcgga^g tgaatgagct 
gaggagcgca gggggcggcg agcgggaggc 
gcctcgcggg cggcgacgga- gccgggatcg 
caggtctcca gctggggtgg acgtgcccac 
tggacgtgac aagcaggaca tgacatggtc 
tccggccttc ctgaacacct taggctggtg 
tacttcctcc acggagtcgg cacactatgg 
tgaaagaaat ggtgctaccc agctcaagcc 
tagcttggaa atgaatatgc tgcactttac 
ctctggaagg tgaagctaga ggaaccagac 
gaacagcagc gcccgcagca cccaccccgc 
tgcggcggac ggttgaccac cagccaccac 
atgacgccgt ccccaccacc tccctcttct 
gagctttcct gtctttcctt ttttctgaga 
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cgtcccttct gaatttaatt tgcactaagt 
ccttgagtct cagtacgagt gtgcgtgagt 
gcccggccgc cctccatctg ggccgggtga 
ccctgccctc gcctagtctg gaagctccga 
cattacgccc catctcgctc tgtgcccctc 
tcatcatcaa taaacactgt tacagcaaaa 
aaagaaaaag aaaaggaaaa aaagagaggg 



catctgcact ggttggagtt gtggag^cqc4*1740 
gtg^gccacc ttggsaagtg cctgtgcaggI300 
ctgggcgccg gctgtgtgcc cgaggcctcal860 
ccgacatcac ggagcagcct tcaagcattcl920 
cccaccaggg cttcagcagg agccctggacl980 
aaaaaaaaga aaaaaaaaag aaaagaaaaa2040 
aggaagaagg .agaaaaggga gtgtgg 2096 



(2) INFORMATION OBER SEQ ID NO: 23: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1 348 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblienjng und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 23 

ctctcgcgag gccccagaga gcaggcgctg ggcagtgtgg aggtcgttgg agtcacttcc 60 
gcgtcaccag ctcctgtgcc tgccagtcgg tgcccctccc gctccagcca . tgctctccgc 120 
cctcgcccgg cctgccagcg ctgctctccg ccgcagcttc agcacctcgg cccagaacaa 180 
tgctaaagta gctgtgctag gggcctctgg aggcatcggg cagccacttt cacttctcct 240 
gaagaacagc cccttggtga gccgcctgac cctctatgat atcgcgcaca cacccggagt 300 
ggccgcagat ctgagccaca tcgagaccaa agccgctgtg. aaaggctacc tcggacctga 360 
acagctgcct gactgcctga aaggttgtga tgtggtagtt attccggctg gagtccccag 420 
aaagccaggc atgacccggg acgacctgtt caacaccaat gccacgattg tggccaccct 480 
gaccgctgcc tgtgcccagc actgcccgga agccatgatc tgcgtcattg ccaatccggt 540 
taattccacc atccccatca cagcagaagt tttcaagaag catggagtgt acaaccccaa 600 
caaaatcttc ggcgtgacga ccctggacat cqtcagagcc aacacctttg ttgcagagct 660 
gaagggtttg gatccagctc gagtcaacgt ccctgtcatt ggtggccatg ctgggaagac 720 
catcatcccc ctgatctctc? agtgcacccc caaggtggac tttccccagg accagctgac 780 
agcactcact gggcggatcc aggaggccgg cacggaggtg gtcaaggcta aagccggagc 840 
aggctctgcc accctctcca tggcgtatgc cggcgcccgc tttgtcttct cccttgtgga 900 
tgcaatgaat ggaaaggaag gtgttgtgga atgttccttc gttaagtcac aggaaacgga 960 
atgtacctac ttctccacac cgctgctgct tgggaaaaag ggcatcgaga agaacctgggl020 
catcggcaaa gtctcctctt ttgaggagaa gatgatctcg gatgccatcc ccgagctgaal080 
ggcctccatc aagaaggggg aagatttcgt gaagaccctg aagtgagccg ctgtgacgggll40 
tggccagttt ccttaattta tgaaggcatc atgtcactgc aaagccgttg cagataaactl200 
ttgtatttta atttgctttg gtgatgatta ctgtattgac atcatcatgc cttccaaattl260 
gtgggtggct ctgtgggcgc atcaataaaa gccgtccttg attttaaaaa aaaaaaaaaal320 
aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1348 
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(2) INFORMATION UBER SEQ ID NO: 24 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 358 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 24 



aatgggggac cggagaagaa 
ttcttcctct acagcaggct 
tcatgctccg tgatgcatga 
tctccgggta aatgagtgcg 
acgaggatgc ttggcacgta 
agcacccagc gctgccctgg 



gtacaagagc acgggtcagg 
aaccgtggac aagagcaggt 
ggctctgcac aaccactaca 
acggccggca agcccccgct 
ccccgtgtac atacttccca 
ggcccctgcg aaaaaaaaga 



ggcgggactc cgacggctcc 60 
ggcaggaggg gaatgtcttcl20 
cgcagaagag cctctccctgl80 
ccccgggctc tcgcggtcgc240 
ggcacccagc atggaaataa300 
aaaagaatcg aaaagggg 358 



(2) INFORMATION UBER SEQ ID NO: 25: 



(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 89 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 



(vii) SONSTIGE HERKUNFT: 
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(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 25 

gcccctagcc cctggcagac atagctgctt cagtgcccct tttcctctgc tggctagatg60 
gatgttgatg cactggaggt acttttagc 89 

(2) INFORMATION UBER SEQ ID NO: 26: 

(i) SEQUENZ CHARAKTERISTIK: 
. (A) LANGE: 1 632 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 26 

gacactggtt ggttctgata agaggcaggg gaggagaaag ccgaggaaga gggagttgcg 60 
gaagaggagg gagttaacaa gttctcttat ccaccatcac accgggagtg. ttgtccagcc 120 
gtggaggagg aggacgatga agaagctgta aagaaagaag ctcacagaac ctctacttct 180 
gccttgrtctc caggatccaa gcccagcact tgggtgtctt gcccagggga ggaagagaat 240 
caagccacgg aggataaaag aacagaaaga agtaaaggag ccaggaagac ctccgtgtcc 300 
ccccgatctt caggctccga ccccaggtcc tgggagtatc gttcaggaga ggcgtccgag 360 
gagaaggagg aaaaggcaca caaagaaact gggaaaggag aagctgcccc agggccgcaa 420 
tcctcagccc cagcccagag gccccagctc aagtcctggt ggtgccaacc cagtgatgaa 480 
gaggagggtg aggtcaaggc tttgggggca gctgagaagg atggagaagc tgagtgtcct 540 
ccctgcatcc ccccaccaag tgccttcctg aaggcctggg tgtattggcc aggagaggac 600 
acagaggaag aggaagatga ggaagaagat gaggacagtg actctggatc agatgaggaa 660 
gagggagaag ctgaggcttc ctcttccact cctgctacag gtgtcttctt gaagtcctgg 720 
gtctatcagc caggagagga cacacagtga tacaggatca gccgaggatg aaagagaagc 780 
tgagacttct gcttccacac cccctgcaag tgctttcttg aaggcctggg tgtatcggcc 840 
aggagaggac actggatagt gaggataagg aagatgattc agaagcagcc ttaggagaag 900 
ctgagtcaga cccacatccc tcccacccgg accagagggc ccacttcagg ggctggggat 960 
atcgacctgg aaaagagaca gaggaagagg aagctgctga ggactgggga gaagctgagcl020 
cctgcccctt ccgagtggcc atctatgtac ctggagagaa gccaccgcct ccctgggctcl080 
ctcctaggct gcccctccga ctgcaaaggc ggctcaagcg cccagaaacc cctactcatgll40 
atccggaccc tgagactccc ctaaaggcca gaaaggtgcg cttctccgag aaggtcactgl200 
tccatttcct ggctgtctgg gcagggccgg cccaggccgc ccgccagggc ccctgggagcl260 
agcttgctcg ggatcgcagc cgcttcgcac gccgcatcac ccaggcccag gaggagctgal320 
gcccctgcct cacccctgct gcccgggcca gagcctgggc acgcctcagg aacccaccttl380 
tagcccccat ccctgccctc acccagacct tgccttcctc ctctgtccct tcgtccccagl440 
tccagaccac gcccttgagc caagctgtgg ccacaccttc ccgctcgtct gctgctgcaglSOO 
cggctgccct ggacctcagt gggaggcgtg gctgagacca actggtttgc ctataatttal560 
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ttaactattt attttttcta agtgtgggtt tatetaagga ataaagcctt ttgatttgtal6Z0 
acgaaaaaaa aa 1632 



(2) INFORMATION OBER SEQ ID NO: 27: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2972 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editlerung 

hergesteltte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 27 

ccaggacgag cacctcatta cattcttcgt gcctgtcttt gagccgctgc cccctcagta 60 
cttcatccga gtggtgtctg accgctggct ctcttgtgag acccagctgc ctgtctcctt 120 
ccggcacctg atcttgccgg agaagtaccc ccctccaacc gaacttttgg acctgfcagcc 180 
cttgcccgtg tctgctctga gaaacagtgc ctttgagagt ctttaccaag ataaatttcc 240 
tttcttcaat cccatccaga cccaggtgtt taacactgta tacaacagtg acgacaacgt 300 
gtttgtgggg gcccccacgg gcagcgggaa gactatttgt gcagagtttg ccatcctgcg 360 
aatgctgctg cagagctcgg aggggcgctg tgtgtacatc acccccatgg aggccctggc 420 
agagcaggta tacatggact ggtacgagaa gttccaggac aggctcaaca agaaggtggt 480 
actcctgaca ggcgagacca gcacagacct gaagctgctg ggcaaaggga acattatcat 540 
cagcacccct gagaagtggg acatactttc ccggcgatgg aagcagcgca agaacgtgca 600 
gaacatcaac ctcttcgtgg tggatgaggt ccaccttatc gggggcgaga atgggcctgt 660 
cttagaagtg atctgctccc gaatgcgcta catctcctcc cagattgagc ggcccattcg 720 
cattgtggca ctcagctctt cgctctccaa tgccaaggat gtggcccact ggctgggctg 780 
cagtgccacc tccaccttca acttccatcc caatgtgcgt cccgtcccct tggagctgca 840 
catccagggc ttcaacatca gccataeaca aacccgcctg ctctccatgg ccaagcctgt 900 
gtaccatgct atcaccaagc actcgcccaa gaagcctgtc attgtctttg tgccgtctcg 960 
caagcagacc cgcctcactrg ccattgacat cctcaccacc tgtgcagcag acatccaacgl020 
gcagaggttc ttgcactgca ccgagaagga tctgattccg tacctggaga agctaagtgal080 
cagcacgctc aaggaaacgc tgctaaatgg ggtgggctac ctgcatgagg ggctcagcccll40 
catggagcga cgcctggtgg agcagctctt cagctcaggg gctatccagg tggtggtggcl200 
ttctcggagt ctctgctggg gcatgaacgt ggctgcccac ctggtaatca tcatggatacl260 
ccagtacfac aatggcaaga tccacgccta tgtggattac cccatctatg acgt get tea 1320 
gatggtgggc cacgccaacc gccctttgca ggacgatgag gggcgctgtg tcatcatgtgl380 
tcagggctcc aagaaggatt tcttcaagaa gttcttatat gagecattge cagtagaatcl440 
tcacctggac cactgtatgc atgaccactt caatgetgag atcgtcacca agaccattga 150.0 
gaacaagcag gatgctgtgg actacctcac ctggaccttt ctgtaccgcc gcatgacacal560 
gaaccceaat tactacaacc tgcagggcat ctcccatcgt cacttgtegg accacttgtcl620 
agagctggtg gagcagaccc tgagtgacct ggagcagtcc aagtgcatca gcatcgaggal680 
cgagatggac gtggcgcctc tgaacctagg catgatcgcc gectactatt acatcaactal740 
caccaccatt gagctcttca gcatgtccct caatgecaag accaaggtgc gagggcttat!800 



81 



DE 198 18. 619 A 1 



cgagatcatc tccaatgcag cagagtatga gaacattccc atccggcacc atgaagaCaal860 
tctcctgagg cagttggctc agaaggtccc ccacacgctg aataacccta agttcaaLgal320 
tccgcacgtc aagaccaacc tgctcctgca ggctcacttg tctcgcatgc agctgagtgcl980 
tgagttgcag tcagatacgg aggaaatcct tagtaaggca, atccggctca tccaggcctg204O 
cgtggatgtc ctttccagca atgggtggct cagccctgct ctggcagcta tggaactggc2l00 
ccagatggtc acccaagcca tgtggtccaa ggactcatac ctgaagcagc tgccacactt2160 
cacctctgag catatcaaac gttgcacaga caagggagtg gagagtgttt tcgacatcat2220 
ggagatggag gatgaagaac ggaacgcgtt gcttcagctg actgacagcc agattgcaga2280 
tgtggctcgc ttttgtaacc gctaccctaa tatcgaacta tcttatgagg tggtagataa2340 
ggacagcatc cgcagtggcg ggccagttgt ggtgctggtg cagctggagc gagaggagga2400 
agtcacaggc cctgtcattg cgcctctctt cccgcagaaa cgtgaagagg gctggtgggt2460 
ggtgattgga gatgccaagt ccaatagcct catctccatc aagaggctga ccttgcagca2520 
gaaggccaag gtgaagttgg actttgtggc cccagccact ggtgcccaca actacactct2580 
gtacttcatg agtgacgctt acatgggatg tgaccaggag tacaaattca gcgtggatgt2640 
gaaagaagct gagacagaca gtgattcaga ttgagtcctg aggcatttac ttttgggtaa2700 
aggagagttg agcctgaatt aggaatgtgt acattgtagg aatcctggtt gtggggacca2760 
ggtctgtggg cctcaggtct ggccagccag ggctggtgct gtccccgcct acctccactt2820 
cctttccctt gctcactctg gatccagtga cagcaggtgt catgggtcaa gcataaatca2880 
tatatagcat tttcaggcat gttcctggta gttcttttga gtctgacatt ctaataaaat2940 
aatttgtaga aaaaaaacca aaaaaaaaaa aa 2972 

(2) INFORMATION UBER SEQ ID NO: 28: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 496 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 28 

ctcgagccga agagtcctgg acctcctgtg caiagaacatg aaacatctgt ggttcttcct 60 
tctcctggtg gcagctccca* gatgggtcct gtcccaggtg cagctgcagg agtcgggcccl20 
aggactggtg aagccttcgg agaccctgtc cctcacctgc actgtctctg gtggctccatl80 
cagtagttac tactggagct ggatccggca gcccccaggg aagggactgg agtggattgg240 
gtatatctat tacagtggga gcaccaacta caacccctcc ctcaagagtc gagtcaccat300 
atcagtagac acgtccaaga accagttctc cctgaagctg agctctgtga ccgctgcgga360 
cacggccgtg tattactgtg cgagacaggg tatagcagtg gaccagcttg actactgggg420 
ccagggaacc ctggtcaccg tctcctgagc ctgcaccaag gggccatcgg tcttccccct480 
ggcaccctgc tccaag 496 



(2) INFORMATION UBER SEQ ID NO: 29: 
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(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 397 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vli) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 29 

gaggtcctgg acctcctgtg caagaacatg aaacacctgt ggttcttcct cctcctggtg 60 
gcagctccca gatgggtcct gtcccaggtg- cagctgcagg agtcgggccc aggactggtgl20 
aagcctt'cgg agaccctgtc cctcacctgc actgtctctg gtggctccat cagtagttacl80 
tactggagct ggatccggca gcccgccggg aagggactgg agtggattgg gcgtatctat240 
accagtggga gcaccaacta caacccctcc ctcaagagtc gagtcaccat gtcagtagac300 
acgtccaaga accagttctc cctgaagctg agctctgtga ccgccgcgga cacggccgtg360 
tattactgtg cgagagcaaa acgcagctgg acctcag 397 

(2) INFORMATION UBER SEQ ID NO: 30: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 772 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 30" 

gggactcctc agttcacctt ctcacaatga ggctccctgc tcagctcctg gggctgctaa 60 
tgctctgggt ctctggatcc agtggggata ttgtgatgac tcagtctcca ctctccctgcl20 
ccgtcacccc tggagagccg gcctccatct cctgcaggtc tagtcagagc ctcctgcatal80 
gtaatggata caactatttg gattggtacc tgcagaagcc agggcagtct ccacagctcc240 
tgatctattt gggttctaat cgggcctccg gggtccctga caggttcagt ggcagtggat300 
caggcacaga ttttacactg aaaatcagca gagtggaggc tgaggatgtt ggggtttatt360 
actgcatgca agctctacaa actcctctca ctttcggcgg agggaccaag gtggagatca420 
aacgaactgt ggctgcacca tctgtcttca tcttcccgcc atctgatgag cagttgaaat480 
ctggaactgc ctctgttgtg tgectgctga ataacttcta tcccagagag gccaaagtac540 
agtggaaggt ggataacgcc ctccaatcgg gtaactccca ggagagtgtc acagagcagg600 
acagcaagga cagcacctac agcctcagca gcaccctgac gctgagcaaa gcagactacg660 
agaaacacaa agtctacgcc tgcgaagtca cccatcaggg cctgagcttg gcccgttaag720 
aaagggcttt caacaggggg aagtttttag aggggagatg tggccccacc tt 772 

(2) INFORMATION UBER SEQ ID NO: 31: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1031 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 31 

ggggacggaa gcgcagagca cggaccccgc cccctcgcgg ccccgctcgt gacgtcgcgg 60 
ggggcgccgg cctccgcccg gccccgaggg gqtctccccg gaggctcagc cccctctgct 120 
ccccatgggc aactgccagg cagggcacaa cctgcacctg tgtctggccc accacccacc 180 
tctggtctgt gcQactttgaf tcctgctgct ccttggcctc tctggcctgg gccttggcag 240 
cttcctcctc acccacagga ctggcctgcg cagcctgaca tcccccagga ctgggtctct 300 
tttttgagat cttttggcca gctgaccctg tgtcccagga atgggacagt cacagggaag 360 
tggcgagg'gt ctcacgtcgt gggcttgctg accaccttga acttcggaga cggtccagac 420 
aggaacaaga cccggacatt ccaggccaca gtcctgggaa gtcagatggg attgaaagga 480 
tcttctgcag gacaactggt ccttatcaca gccagggtga ccacagaaag gactgcagga 540 
acctgcctat attttagtgc tgttccagga atcctaccct ccagccagcc acccatatcc 600 
tgctcagagg agggggctgg aaatgccacc ctgagcccta gaatgggtga ggaatgtgtt 660 
agtgtctgga gccatgaagg ccttgtgctg accaagctgc tcacctcgga ggagctggct 720 
ctgtgtggct ccaggctgct ggtcttgggc tccttcctgc ttctcttctg tggccttctc 780 
. tgctgtgtca ctgctatgtg cttccacccg cgccgggagt cccactggtc tagaacccgg 840 
ctctgagggc actggcctag ttcccgactt gtttctcagg tgtgaatcaa cttcttgggc 900 
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cttggctctg agttggaaaa ggttttagaa aaagtgaaga . gctogaatgt gggggaaaat 960 
aaaaagcttt tttgcccaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaal020 
aaaaaaaaaa a 1031 



(2) INFORMATION OBER SEQ ID NO: 32: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 739 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBUOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 32 



cggctcgagc cccgctcagt 
ccggggaagg tcagcgccgt 
caaaaggtgt tgcggcttta 
agagacaaat accgatactt 
gaaaaggata tggcgaaggc 
cgtcagcatc cacagccata 
tacgattgct acaaggtccc 
atgtatcctg attactttgc 
gaacgagagg ttaagcagct 
ttgccccctg cccgaaagga 
ccccgggagc ggcccatgta 
atgttacaga acatgcactt 
aaaaaaaaaa aaaaaaaaa 



cacccgcagc aggcgtgcag 
aatggcgttc ttggcgtcgg 
taagcgggcg ctacgccacc 
tgcttgtttg atgagagccc 
cacccagctg ctgaaggagg 
catcttccct gactctcctg 
agaatggtgc ttagatgact 
caagagagaa cagtggaaga 
gcaggaggaa acgccacctg 
aggtgatttg cccccactgt 
gaaagagaga gacctcatct 
gccctaataa aaaatcagtg 



tttcccggct ctccgcgcgg 60 
gaccctacct gacccatcagl20 
tcgagtcgtg gtgcgtccagl80 
ggtttgaaga acataagaat240 
ccgaggaaga attctggtac300 
ggggcacctc ctatgagaga360 
ggcatccttc tgagaaggca420 
aactgcggag ggaaagctgg480 
gtggtccttt aactgaagct540 
ggtggtatat tgtgaccaga600 
ttcatgcttg caagtgaaat660 
aaatggaaaa aaaaaaaaaa720 

739 



(2) INFORMATION OBER SEQ ID NO: 33: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 651 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblieaing und Editierung 

hergestellte partielle cDNA 
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(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 33 

cggctcgagc ctcagttcac cttctcacca tgaggctccc tgctcagctc ctggggctgc 60 
taatgctctg ggtccctgga tccagtgagg atattgtgat gacccagact ccactctcccl20 
tgcccgtcac ccctggagag ccggcctcca tctcctgcag gtctagtcag agcctcttggl80 
atagtgatga tggaaacacc tatttggact ggtacctgca gaagccaggg cagtctccac240 
agctcctgat ctatacgctt tcctatcggg cctctggagt cccagacagg ttcagtggca300 
gtgggtcagg cactgatttc acactgaaaa tcagcagggt ggaggctgag gatgttggag360 
tttattactg catgcaacgt atagaatttc cttacacttt tggccagggg accaagctgg420 
agatcaaacg aactgtggct gcaccatctg tcttcatctt cccgccatct ggatgagcag480 
ttgaaatctg gaacttgcct ctgttgttgt gcctgcttga ataactttct attcccagag540 
aggggcaaag taacagtgga aggttggatt aacgccctgc aattcgggta actgcccagg600 
gagtagtttt cacagggcag gggcagcaag gacagcacct acagtcttag t 651 

(2) INFORMATION ClBER SEQ ID NO: 34: 

(i) SEQUENZ CHARAKTER1STIK: 

(A) LANGE: 823 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: elnzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOlTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 34 

ccgcgtcgac aaattttttt aaagatcatc gatgaagaga gaaaatgcgc ttttctacag 60 
agtccccttc ccacccacag ccccatcccc agataagcgg ggagttccct ggcgcggtgcl20 
cagtttctag ccgctgagtg ggcgtgtgcg cggctccaag tgcgcctgcg tactgctcacl80 
tccccagctc cgcgccctgc tccgttcctc ccaaaactct gaatcgaaga actttccgga240 
agtttctgag agcccagacc ggcgggcacg cgcccatccc caaccccctc tgttaatccc300 
taccagcctg cagtcctggc tgcttccaag caggaggtgg ggcctctggc ctagcggggc360 
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cgaaaggcag tgcccctccc ccgcagtctg atttcc^tct- tccccccaac ggcaagcacg!20 

aggagcggca ggacgagcat ggctacatct cccggtgctt cacgcggaaa tacacgctgc480 

cccccggtgt ggaccccacc caagtttcct cctccctgtc ccctgagggc acactgaccg540 

tggaggcccc catgcccaag ctagccacgc agtccaacga gatcaccatc ccagtcacct600 

tcgagtcgcg ggcccagctt gggggcccag aagctgcaaa atccgatgag actgccgcca660 

agtaaagcct tagqccggat. gcccacccct gctgccgcca ctggctgtgc ctcccccgcc720 

acctgtgtgt tcttttgata catttatctt ctgtttttct caaataaagt tcaaagcaac780 

cacctggtca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 823 

(2) INFORMATION OBER SEQ ID NO: 35: 

(i) SEQUENZ CHARACTERISTIC 

(A) LANGE: 457 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Edib'erung 

hergesteltte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 35 

cataagggaa atgctttctg agagtcatgg atctcatgtg caagaaaatg aagcacctgt 60 

ggttcttcct cctgctggtg gcggctccca gatgggtcct gtcccagctg cagctgcaggl20 

agtcgggccc aggactggtg aagccttcgg agaccctgtc cctcacctgc actgtctctglBO 

gtggctccat cagcagtagt agttactact ggggctggat ccgccagccc ccagggaagg240 

ggctggagtg gattgggagt atctattata gtgggagcac ctactacaac ccgtccctca300 

agagtcgagt caccatatcc gtagacacgt ccaagaagta cttctccctg aagctgagct360 

ctgtgaccgc cgcagacacg gctgtgtatt actgtgcgag acatgactgg tattacgata420 

ttttgactgg ttatgcgaaa cccggcacag gttcgac 457 

(2) INFORMATION UBER SEQ ID NO: 36: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1203 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Edftierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
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(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBUOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 36 

gtcgggggcg cctgcgcagt. cgctcttcct 
ccggtgcagc gggagattca ccaggactgg 
agcagcatca agaaaatcgc agactttctc 
cttgcaacac taaacgagaa attgacagcc 
cgggtgacaa aaggtgagac. actcacctag 
tttacacaac acaggccaca tgggaaaggc 
ttaaagagca acagggctta ttcttgtttt 
tgccatctgg ggtgtggtgt ggtatgtggg 
gttaggcatt ttaccttttc agtaacattt 
attcacagcc aaaagcctgg gactctttgt 
tctctctctc aaactttcct taaagttctc 
tttgtctcct ccccaccttt ggtgggaagt 
cctggcaatg .tggctgccag agaatgttgt 
agaggtcaag gccaggcccc cacttggctt 
cacttggagt gtctatgcct ctcatatttc 
tgctgtgatt atggtgggga gaggagctag 
atacatggcg ttattattct aacataaaac 
aggtgcttac cagcccaagt ccccatgttt 
atcatttggt cattcagcac atttaccaag 
taaaacatac agcattgaat cagaaaaaaa 
aaa 



(2) INFORMATION UBER SEQ ID NO: 37: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 207 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partlelle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 



caggcggcgg ccatggcggg acaggaggat 60 
gctaaccggg agtacattga gataatcacc 120 
aactcgttcg atatgtcttg tcgttcaaga 180 
cttgaacgga gaatagagta cattgaagct 240 
aacagtgccg tgctgctgct gggaagttgc 300 
cccagcagcc ttcagctcct tcctttctcc 360 
tcttttttca aaagtgtggc ctttgggctc 420 
aagaagttca gaggaaccgt tggaaacgac 480 
tatacatcta cttgtcaatg tatttgagac 540 
gaaggtcctc ctcacctcta tctttctttc 600 
attgcctttg cactgcttct gtgaacagtc 660 
gcggggcagt cctggtcaag acactcatgc 720 
tgctaaccca ccagtttctt gttgatttgg 780 
gaagggacat tttcagactt ttctttctgt 840 
cctaataaac tcctcaactt tttatctgac 900 
agatgggttc acttattgca cagaaatgta 960 
tttcagatgt agctgtttga ttcaaagcctl020 
ggactttcag ctgactagct catcttgggal080 
tatttactat gtaggcatgt taaactccaall40 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa!200 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 37 



cggctcgagg ccgccctcgg tgtcagtgtc cccaggacag acggccagga tcacctgctc 60 

tggacatgca ttgccaaagc aatgatgctt attggtacca gtcagaggcc agggccaggcl20 

ccctgtgctt ggtggatccc ttgaaagaac attggaggag ggcccttcag ggcatgccctl80 

ggagacggat tgctctgggc ttccaac 207 

(2) INFORMATION OBER SEQ ID NO: 38: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 346 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus elnzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 38 

cggctcgagc ggctcgaggc cgggatggtg ggtgctacgc cccttgggta ctggggccag 60 
ggaaccctgg tcaccgtctc ctcagcctcc accaagggcc catcgggctt cccccgggcal20 
ccctcctcca agagcacctc tgggggcaca gcggccctgg gctgcctggt caaggactacl80 
ttccccgaac cggtgacggt gtcgtggaac tcaggcgcct gaccagcggc gtgcacacct240 
tcccggctgt ctacagtctc aggactctac tcctcagcag cgtggtgacg tgccctccag300 
cagttgggca .ccagacctac atctgcaagt gaatcgaagc cagcaa 346 



(2) INFORMATION UBER SEQ ID NO: 39: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 926 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 



89 



DE 198 18 619 Al 



(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xl) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 39 

cggctctaag gaagcagcac tggtggtgcc tcagccatgg cctggaccgt tctcctcctc 60 
ggcctcctct ctcactgcac aggctctgtg acctcctatg tgctgactca gccaccctcgl20 
gtgtcagtgg ccccaggaca gacggccagg attacctgtg ggggaaacaa cattggaagtlSO 
aaaagtgtgc actggtacca gcagaagcca ggccaggccc ctgtgctggt cgtctatgat240 
gatagcgacc ggccctcagg gatccctgag cgattctctg gctccaactc tgggaacacg300 
gccaccctga ccatcagcag ggtcgaagcc ggggatgagg ccgactatta ctgtcaggtg360 
tgggatagta gtagtgatca ttgggtgttc ggcggaggga ccaagctgac cgtcctaggt420 
cagcccaagg. ctgccccctc ggtcactctg ttcccgccct cctctgagga gcttcaagcc480 
aacaaggcca cactggtgtg tctcataagt gacttctacc cgggagccgt gacagtggcc540 
tggaaggcag atagcagccc cgtcaaggcg ggagtggaga ccaccacacc ctccaaacaa600 
agcaacaaca agtacgcggc cagcagctat ctgagcctga cgcctgagca gtggaagtcc660 
cacagaagct acagctgcca ggtcacgcat gaagggagca ccgtggagaa gacagtggcc720 
cctacagaat gttcataggt tctcaaccct caccccccac cacgggagac tagagctgca780 
ggatcccagg ggaggggtct ctcctcqcac cccaaggcat caagcccttc tccctgcact840 
caataaaccc tcaataaata ttctcattgt caatcaggaa aaaaaaaaaa aaaaaaaaaa90O 
aaaaaaaaaa aaaaaaaaaa aaaaaa 926 

(2) INFORMATION UBER SEQ ID NO: 40: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2384 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MDLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte parb'elle cDNA 

(III) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 40 

gcctcccgcc cgccgcctct gtctccctct ctccacaaac tgcccaggag tgagtagctg 60 
ctttcggtcc gccggacaca ccggacagat agacgtgcgg acggcccacc accccagccc 120 
gccaactagt cagcctgcgc ctggcgcctc ccctctccag gtccatccgc catgtggccc 180 
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ctgtggcgcc tcgtgtctct gctggccctg agccaggcoc tgcccttt^a gcagagaggc 210 
ttctgggact tcaccctgga cgatgggcca ttcatgELtga cegatga'gga 'agcttcgggc 300 
gctgacacct cgggcgtcct ggacccggac tctgtcacac ccacctacag cgccatgtgt 360 
cctttcggct gccactgcca cctgcgggtg gttcagtgct ccgacctggg tctgaagtct 420 
gtgcccaaag agatctcccc tgacaccacg ctgctggacc tgcagaacaa cgacatctcc 480 
gagctccgca aggatgactt caagggtctc cagcacctct acgccctcgt cctggtgaac 540 
aacaagatct ccaagatcca tgagaaggcc ttcagcccac tgcggaagct gcagaagctc 600 
tacatctcca agaaccacct ggtggagatc ccgcccaacc tacccagctc cctggtggag 660 
ctccgcatcc acgacaaccg catccgcaag gtgcccaagg gagtgttcag tgggctccgg 720 
aacatgaact gcatcgagat gggcgggaac ccactggaga acagtggctt tgaacctgga 780 
gccttcgatg gcctgaagct caactacctg cgcatctcag aggccaagct gactggcatc 840 
cccaaagacc tccctgagac cctgaatgaa ctccacctag accacaacaa aatccaggcc 900 
atcgaactgg aggacctgct tcgctactcc aagctgtaca ggctgggcct aggccacaac 960 
cagatcagga tgatpgagaa cgggagcctg agcttcctgc ccaccctccg ggagctccacl020 
ttggacaaca acaagttggc cagggtgccc tcagggctcc cagacctcaa gctcctccagl080 
gtggtctatc tgcactccaa caacatcacc aaagtgggtg tcaacgactt ctgtcccatgll40 
ggcttcgggg tgaagcgggc ctactacaac ggcatcagcc tcttcaacaa ccccgtgcccl200 
tactgggagg tgcagccggc cactttccgc tgcgtcactg acCgcctggc catccagtttl260 
ggcaactaca aaaagtagag gcagctgcag ccaccgcggg gcctcagtgg gggtctctggl320 
ggaacacagc cagacatcct gatggggagg cagagccagg aagctaagcc agggcccagcl380 
tgcgtccaac ccagcccccc acctcgggtc cctgacccca gctcgatgcc ccatcaccgcl440 
ctctccctgg ctcccaaggg tgcaggtggg cgcaaggccc ggcccccatc acatgttccclSOO 
ttggcctcag agctgcccct gctctcccac cacagccacc cagaggcacc ccatgaagctl560 
tttttctcgt tcactcccaa acccaagtgt ccaaggctcc agtcctagga gaacagtcccl620 
tgggtcagca gccaggaggc. ggtccataag aatggggaca gtgggctctg ccagggctgcl680 
cgcacctgtc cagacacaca tgttctgttc ctcctcctca tgcatttcca gcctttcaacl740 
cctccccgac tctgcggctc ccctcagccc ccttgcaagt tcatggcctg tccctcccagl800 
acccctgctc cactggccct tcgaccagtc ctcccttctg ttctctcttt ccccgtccttl860 
cctctctctc tctctctctc tttctgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtgl920 
tgtgtgtgtc ttgtgcttcc tcagaccttt ctcgcttctg agcttggtgg. cctgttccctl980 
ccatctctcc gaacctggct tcgcctgtcc' ctttcactcc acaccctctg gccttctgcc2040 
ttgagctggg actgctttct gtctgtccgg cctgcaccca gcccctgccc acaaaacccc2100 
agggacagcg gtctccccag cctgccctgc tcaggccttg cccccaaacc tgtactgtcc2160 
cggaggaggt tgggaggtgg aggcccagca tcccgcgcag atgacaccat caaccgccag2220 
agtcccagac accggttttc ctagaagccc ctcaccccca ctggcccact ggtggctagg22B0 
tctcccctta tccttctggt ccagcgcaag gaggggctgc ttctgaggtc ggtggctgtc2340 
tttccattaa agaaacaccg tgcaacgtga aaaaaaaaaa aaaa . 2384 



(2) INFORMATION UBER SEQ ID NO: 41: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 334 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYPi aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 
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(A) BIBLIOTHEK: cDNA library 
(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 41 



ctcgagccga attcggctcg agaggagccc agccctggga ttttcaggtg ttttcatttg 60 

gtgatcagga ctgaacagag agaactcacc atggagtttg ggctgagctg gctttttcttl20 

gtggctattt taaaaggtgt ccagtgtgag gtgcagctgt tggagtctgg gggaggcttgl80 

gtacagcctg gggggtccct gagactctcc tgtgcagcct ctggattcac ctttagcagc240 

tatgccatga gctgggtccg ccaggctcca gggaaggggc tggagtgggt ctcaggtatt300 

agtggtagtg gtgtgatagt acacactacg caga 334 

(2) INFORMATION UBER SEQ ID NO: 42: 

(i) SEQUENZ CHARAKJERISTIK: 

(A) LANGE: 845 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 42 

gcgttccctc cgccgagcta cttctttctt tccttttttt tttttttctg gctaacagaa 60 

ttttattgtt aaatcacaga aactttagtg caaaacaaaa atcacgaagt ccatttaatal20 

gcaacttcat gtcctgctgg ctttgcttgc tgtctcctgg caaccagaag tggacagaagl80 

cgtgggtgcc caagtgggcc acagacagct tccaaccccc acaccccagc atccaatcca240 

cacccagcag acccttcggc atgccgccct ctaccaggaa gccagaggcc taggagctcg300 

ccatccatat ttatttgaaa aggtcaaaag gagcatctat gagacaaggg aggggtgcag360 

gctgaagcag cgcctcaaca gccagggaca tgtaggcaac acgagcaggc acagcgcggc420 

caccactgtc cacacgctca cacaagccag gcccgcaggg ccttcggaga gctagcaggt480 

tacattcagg cagatggccc tcttcccacc caaacccaca gaaccccaaa caaggcatca540 

ccaggaaaga cacgggaaag ccaaatcaca gttgaaccag ggacagagaa cccttggccc600 

cactgatgtc ccaagccacc agcagctgct tccaaaatcc ctatgctatt acagtgggaa660 

ttacatcatt taaaaagcct gattattccc aggcttctaa tctttcatat aaaactgcct720 

ttgttttgct cctttgttca actcagaggc ccagcaaagc gggcagggtc cctgatcagg780 

gcaggagccc acctcagaag cccatgccgc accagtgccc aagcacatgt cagtgctcag840 

aacaa g 45 



(2) INFORMATION UBER SEQ ID NO: 43: 
(i) SEQUENZ CHARAKTERISTIK: 
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(A) LANGE: 2233 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 
hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 

(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 43 

gaattcagaa gttaatgatg ttgggtaaga gaacaatggt aagagagcaa tctaagaata 60 
tatcacctac tfctaatttta tatgagagta catggaggta gctgtgatgt ggaaatgtag 120 
cactgctcct acccacgcag atttattcca gtgaaacaac aactggaact tcaagtaact 180 
cctcccagag tacttccaac tctgggttgg ccccaaatcc aactaatgcc accaccaagg 240 
cggctgqtgg tgccctgcag tcaacagcca gtctcttcgt ggtctcactc tctcttctgc 300 
atctctactc ttaagagact caggccaaga aacgtcttct aaatttcccc atcttctaaa 360 
cccaatccaa atggcgtctg gaagtccaat gtggcaagga aaaacaggtc ttcatcgaat 420 
ctactaattc cacacctttt attgacacag aaaatgttga gaatcccaaa tttgattgat 480 
ttgaagaaca tgtgagaggt ttgactagat gatggatgcc aatattaaat ctgctggagt 540 
ttcatgtaca agatgaagga gaggcaacat ccaaaatagt taagacatga tttccttgaa 600 
tgtggcttga gaaatatgga cacttaatac taccttgaaa ataagaatag aaataaagga 660 
tgggattgtg gaatggagat tcagttttca tttggttcat taattctata aggccataaa 720 
acaggtaata taaaaagctt ccatgattct atttatatgt acatgagaag gaacttccag 780 
gtgttactgt aattcctcaa cgtattgttt cgacagcact aatttaatgc cgatatactc 840 
tagatgaagt tttacattgt tgagctattg ctgttctctt gggaactgaa ctcactttcc 900 
tcctgaggct ttggatttga cattgcattt gaccttttat gtagtaattg acatgtgcca 960 
gggcaatgat gaatgagaat ctacccccag atccaagcat cctgagcaac tcttgattatl020 
ccatattgag tcaaatggta ggcatttcct atcacctgtt tccattcaac aagagcactal080 
cattcattta gctaaacgga ttccaaagag tagaattgca ttgaccgcga ctaatttcaall40 
aatgcttttt attattatta ttttttagac agtctcactt tgtcgcccag gccggagtgcl200 
agtggtgcga tctcagatca gtgtaccatt tgcctcccgg gctcaagcga ttctcctgccl260 
tcagcctccc aagtagctgg gattacaggc acctgccacc atgcccggct aatttttgtal320 
attttagtag agacagggtt tcaccatgtt gcccaggctg gtttcgaact cctgacctcal380 
ggtgatccac ccgcctcggc ctcccaaagt gctgggatta caggcttgag cccccgcgccl440 
cagccatcaa aatgcttttt atttctgcat atgttgaata ctttttacaa tttaaaaaaalSOO 
tgatctgttt tgaaggcaaa attgcaaatc ttgaaattaa gaaggcaaaa atgtaaaggal560 
gtcaaaacta taaatcaagb atttgggaag tgaagactgg aagctaattt gcattaaattl620 
cacaaacttt tatactcttt ctgtatatac attttttttc tttaaaaaac aactatggatl680 
cagaatagcc acatttagaa cactttttgt tatcagtcaa tatttttaga tagttagaacl740 
ctggtcctaa gcctaaaagt gggcttgatt ctgcagtaaa tcttttacaa ctgcctcgacl800 
acacataaac ctttttaaaa atagacactc cccgaagtct tttgttcgca tggtcacaca!860 
ctgatgctta gatgttccag taatctaata tggccacagt agtcttgatg accaaagtccl920 
tttttttcca tctttagaaa actacatggg aacaaacaga tcgaacagtt ttgaagctacl980 
tgtgtgtgtg aatgaacact cttgctttat tccagaatgc tgtacatcta ttttggattg2040 
tatattgtgt ttgtgtattt acgctttgat tcatagtaac ttcttatgga attgatttgc2100 
attgaacaca aactgtaaat aaaaagaaat. ggctgaaaga gcaaaaaaaa aggagggcag2160 
gagagaggaa aaggggagga agaggagggg ggaaagagaa gggagagaga aggaggggga2220 
aggaggtggg ggg 2233 
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(2) INFORMATION OBER SEQ ID NO: 44: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 243 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblieaing und Editieaing 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 44 

ggagcccagc actagaagtc ggcggtgttt ccattcggtg atcagcactg aacacagagg 60 
actcaccatg gagtttgggc tgagctgggt tttcctcgtt gctcttttaa gaggtgtccal20 
gtgtcaggtg cactggtgga gcggggagcg ggtcagcagg agtcctgaat cctgtgacgcl80 
tgatcagtcc tatatcagat ggcgcagctc agcagggtga tggggtatga atgataacat240 
aca 243 



(2) INFORMATION UBER SEQ ID NO: 45: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 817 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 45 

gttttttttt tttttttttg aagagagcag attctcttta ttgagatacg ggacacagcg 60 

aagggtggag agacggaaca gccccccagc ctcagccctc tccacggggg ccggatgccal20 

gagatgggag aagggattca gtctctcgcc cgggaaaccc agtcccacag agggcgccggl80 

caagggtggg acgcgacctg ggtgacacgg tgcagggagt ctttaaatag aggaggggct240 

ggagcgggga aacgcgccgg ggccctagcg caccatgtat tccttgcgct tattgagccg300 

aacttggcag aaagagaagc ctccgaggag gaggtaaagg cctgcagcga tgaaacagtt360 

gtagctgact tgctcgtaaa ggttgtatat gttctggggg ccattctcaa aatctttctc420 

cgtgaaggga acgtcctcaa tcaacacagc ggaatggaca ttgaaaaata ttccgagcat480 

tatcaacatg atcactcccc aggcgctgag gacgatgccg caggcggcca gcttcggccc540 

acagcacagg agcgacgcca taaagaaggg agtcggggat cgccgaggtg caagcgggct600 

cggaaagcgg tgggagaaag cccaggatgc cctcgcaggg gggcagaggg ggcgtggccc660 

cggcctcaac catcccatcc gggggcggca ggcggaaaag gctgggctcc tctcaggact720 

ttcgcgggag acggcgqcgt ctgaaaccaa. aactgctcct ggggaaacct tccttgacct780 

ctgtagctag ggcgtgagta ttggaagagc gagggcc 817 

(2) INFORMATION OBER SEQ ID NO: 46: 



(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1644 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 



(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 46 



gttccggctc 
aacacctgtg 
agctgcagga 
ctgtctctgg 
cagggaaggg 
cgtccctcaa 
agctgagctc 
cctacggtga 
caccgtctcc 
ccagccagat 
actcagtgtg 
caggatgcct 
tgcctagccg 
gtgactgtgc 
accccatctc 



acatgggaaa 
gttcttcctc 
gtcgggccca 
tggctccatc 
cctggagtgg 
gagtcgagtt 
tgtgactgcc 
ctcgagatac 
tcagcatccc 
gggaacgtgg 
acctggagcg 
ccggggacct 
gcaagtccgt 
cctgcccagt 
cctcatgctg 



tactttctga 
ctgctggtgg 
ggactggtga 
agcagtggtg 
attgggtaca 
accatatcag 
gcggacacgg 
tactactacg 
cgaccagccc 
tcatcgcctg 
aaagggacag 
gtacaccacg 
gacatgccac 
tccctcaact 



gagtcctgga 
cagctcccag 
agccttcaca 
gttactactg 
tctattacag 
tagacacgtc 
ccgtgtatta 
gtatggacgt 
caaggtcttc 
cctggtccag 
ggcgtgaccg 
agcagccagc 
gtgaagcact 
ccacctaccc 



cctcctgtgc 
atgggtcctg 
gaccctgtcc 
gagctggatc 
tgggagcacc 
taagaaccag 
ctgtgcgaga 
ctggggccaa 
ccgctgagcc 
ggcttcttcc 
ccagaaactt 
tgaccctgcc 
acacgaatcc 
catctccctc 



ccacccccga ctgtcactgc accgaccggc 



aagaacatga 60 
tcccaggtgc 120 
ctcacctgca 180 
cgccagcacc 240 
tactacaacc 300 
ttctccctga 360 
gagcatctct 420 
gggacccggt 480 
tctgcagcac 540 
cccaggagcc 600 . 
cccacccagc 660 
ggccacacag 720 
cagccaggat 780 
aactccacct 840 
cctcgaggac 900 
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ctgctcttag gttcagaagc gaacctcacg tgcacsctqa cccjgcctgag aga"tgcctca 96.Q 
ggtgtcacct tcacctggac gccctcaagt gggaagagcg ctg'ctcaagg accacctgagl020 
cgtgacctct gtggctgcta cagciytgtcc agtgtcctgc cgggctgtgc cgagccatggl080 
aaccatggga agaccttcac ttgcactgct gcctaccccg agtccaagac cccgctaaccll40 
gccaccctct caaaatccgg aaacacattc cggcccgagg tccacctgct gccgccgccgl200 
tcggaggagc tggccctgaa cgagctggtg acgctgacgt gcctggcacg cggcttcagcl260 
cccaaggacg tgctggttcg ctggctgcag gggtcacagg agctgccccg cgagaagtacl320 
ctgacttggg catcccggca ggagcccagc cagggcacca ccaccttcgc tgtgaccagcl380 
atactgcgcg tggcagccga ggactggaag aagggggaca ccttctcctg catggtgggcl440 
cacgaggccc tgccgctggc cttcacacag aagaccatcg accgcttggc gggtaaaccclSOO 
acccatgtca atgtgtctgt tgtcatggcg gaggtggacg gcacctgcta ctgagccgccl560 
cgcctgtccc cacccctgaa taaactccat gctcccccaa gcaaaaaaaa aaaaaaaaaal620 
aaaaaaaaaa aaaaaaaaaa aaaa 1644 



(2) INFORMATION USER SEQ ID NO: 47: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1 133 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus elnzelnen ESTs durch Assemblierung unci Editieaing 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 47 

ttcagaatga cacatgccaa acagtggttc ttattaaatc 60 

attcagaaat aptgatgctt tgtgtatcta ttttcttctc 120 

tgaatgggga aagacaatca ttgaatacaa aacaaataag 180 
tgatattgca cctttggaca tcggtggtgc tgaccaggaa 240 

agtctgtttc aaataaatga actcaatcta aattaaaaaa 300 

ttctctttgc catttcttct tcttcttttt taactgaaag 360 

tgcacatcta cttgcttaaa ttgtgggcaa aagagaaaaa 420 

gtgcaataca gtttcattaa ctccttcccc cgctccccca 480 

aacactctta cacctgttat ggaaaatgtc aacctttgta 540 

gaaaaataaa aaccataaac atttgcacca cttgtggctt 600 

gaagtttaaa acccaaactt ccaaaggttt aaactacctc 660 

tgatccacat tgttaggtgc tgacctagac agagatgaac 720 

gttcataata caaaggtgct aattaatagt atttcagata 780 

ctagaagaat ttgagaagaa atactcc'tgt attgagttgt 840 

aaaaatttga tttagcattc atattttcca tcttattccc 900 

tttgcccaaa tcttcttcag attcagcatt tgttctttgc 960 

tccatggttc cacagaagct ttgtttcttg ggcaagcagal020 

ttgtatatgt gagatgttta aataaattgt gaaaaaaatgl080 

tccaaaagaa aaaaaaaaaa aaaaaagtcg acc 1133 



atttatctgg gacagacatc 
aaaggttcag atattatcag 
tttaaacaga aaaagacaaa 
ccatcacgcc tgcccttcct 
ttctttgtgg acattggccc 
gaaagaaatt tgaaaaaact 
ctgaatcctt ccatttcttc 
gaaggattga tcagagcattf 
aaaatttgaa tttttttttc 
agaaaaccaa aataaaaatt 
ttgaatatet tccacagagg 
aaaacacttt cccatgagtg 
tgaggtcctt gttttgtttt 
cttgaagaat gttgatggtg 
atcgtgtggt gtatttttta 
aattaaaagt atgcagatta 
cagtctcatt ttcatcttct 
aaaattaaat tgtacctatt 
aaataaagca tgtttggttt 
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(2) INFORMATION UBER SEQ ID NO: 48: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 969 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vll) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 48 

gaggaggagg gtgtatctcc tttcgtcgga ccgccccttg gcttctgcac tgatggtggg 60 

tggatgagta atgcatccag gaagcctgga ggcctgtggt ttccgcaccc gctgccacccl20 

ccgcccctag cgtggacatt tatcctctag cgctcaggcc ctgccgccat cgccgcagatl80 

ccagcgccca gagagacacc agagaaccca ccatggcccc ctttgagccc ctggcttctg240 
gcatcctgtt gttgctgtgg ctgatagccc ccagcagggc ctgcacctgt gtcccacccc300 

acccacagac ggccttctgc aattccgacc tcgtcatcag ggccaagttc gtggggacac360 

cagaagtcaa ccagaccacc ttataccagc gttatgagat caagatgacc aagatgtata420 

aagggttcca agccttaggg gatgccgctg acatccggtt cgtctacacc cccgccatgg480 

agagtgtctg cggatacttc cacaggtccc acaaccgcag cgaggagttt ctcattgctg540 

gaaaactgca ggatggactc ttgcacatca ctacctgcag tttcgtggct ccctggaaca600 

gcctgagctt agctcagcgc cggggcttca ccaagaccta cactgttggc tgtgaggaat660 

gcacagtgtt tccctgttta tccatcccct gcaaactgca gagtggcact cattgcttgt720 

ggacggacca gctcctccaa ggctctgaaa agggcttcca gtcccgtcac cttgcctgcc780 

tgcctcggga gccagggctg tgcacctggc agtccctgcg gtcccagata gcctgaatcc840 

tgcccggagt ggaagctgaa gcctgcacag tgtccaccct gttcccactc ccatctttct900 

tccggacaat gaaataaaga gttaccaccc agcaaaaaaa aaaaaaaaaa acaagtcgtc960 

gcgtgctgt ' 969 



(2) INFORMATION UBER SEQ ID NO: 49: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 617 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 
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(Hi) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 49 

cctacaccta ccctcccttt gggtttctat tcggaccgcg atgatttgct ttggaaggct 60 
taaccccctt cttccccaaa cttgcccccg gagaaccccc agccttacga 6cctcctcctl20 
gaagatgcaa aaccagcttg ccggccgcgc tctcttccag gacatcaaga agccagctgal80 
agatgagtgg ggtaaaaccc cagacgccat gaaagctgcc atggccctgg agaaaaagct240 
gaaccagggc cttttggatc ttcatgccct gggttctgcc cgcacggacc cccatctctg300 
tgacttcctg gagactcact tcctagatga ggaagtgaag cttatcaaga agatgggtga360 
ccacctgacc aacctccaca ggctgggtgg cccggaggct gggctgggcg agtatctctt420 
cgaaaggctc actctcaagc acgactaaga gccttctgag cccagcgact tctgaagggc480 
cccttgcaaa gtaatagggc ttctgcctaa gcctctccct ccagccaata ggcagctttc540 
ttaactatcc. taacaagcct tggaccaaat ggaaataaag ctttttgatg caaaaaaaga600 
ggagggggga aaaaagc 617 

(2) INFORMATION UBER SEQ ID NO: 50: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 704 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partlelle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 50 



ggggagactc gtcaccaggc gtgcagtggg cactgctggg ctcccccatc ccgtcctaac 60 
ccggaacagc cccgggcagg aggcgtggaa agtcgagggg gtaaaccgcg aatgtgcgttl20 
gtgtaagcca cggcgcaggg tggggcgcgg gcgggacttg ggcgggcggg gtgggcttggl80 
ccgagctggc ctccggggca ccgaccgcta taaggccagt cggactgcga cacagcccat240 
cccctcgacc gctcgcgtcg catttggccg cctccctacc gctccaagcc cagccctcag300 
ccatggcatg ccccctggat caggccattg gcctcctcgt ggccatcttc cacaagtact360 
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ccggcaggga gggtgacaag cacaccctga gcaagaagga gctgaaggag ctgatccaga42C 

aggagctcac cattggctcg aagctgcagg atgctgaaat tgcaaggctg atggaagact480 

tggaccggaa caaggaccag gaggtgaact tccaggagta tgtcaccttc ctgggggcct540 

tggctttgat ctacaatgaa gccctcaagg gctgaaaata aatagggaag atggagacac600 

cctctggggg tcctctctga gtcaaatcca gtggtgggta attgtacaat aaattttttt660 

tggtcaaatt taaaaaaaaa aaaaaaagag aaaaaagggt gage 704 

(2) INFORMATION OBER SEQ ID NO: 51: 

(A) LANGE: 95 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 51: 

PCSSQFHPVE NRSQEPLAGD SMSPRTLPVQ NMNNAMFLQK TLSLSFIGGN HQTTAECRTL60 
SRTTDLSPSH SPYHHKSHNK KEKRYFGFKK SKKIM 95 

(2) INFORMATION UBER SEQ ID NO: 52: 

(A) LANGE: 76 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(il) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vl) HERKUNFT: 

(A) ORGANISMUS: MENSCH • 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 52: 

LPRDTWFKLK CLTDHSRHVL ^HSYVNVSHLT WVHCLQTEHR LPLAWFENRN RAMPTDPSYV60 
WASKWNCTFI QIFTCL 76 

(2) INFORMATION UBER SEQ ID NO: 53: 

(A) LANGE: 90 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
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(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 53: 

RVNNCQEQLV VLKYNPQPRI PPVLQMDQLK QANTEDTKHE VRFIETRVTP LDELNTKMTL60 
TLSRYRSSET CLQNEIPEEF CSYPEIRGSN 90 

(2) INFORMATION UBER SEQ ID NO: 54: 

(A) LANGE: 117 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 54: 

PLQPPRAMAP RGCIVAVFAI FCISRLLCSH GAPVAPMTPY LMLCQPHKRC GDKFYDPLQH 60 
CCYDDAWPL ARTQTCGNCT FRVCFEQCCP WTFMVKLINQ NCDSARTSDD RLCRSVS 117 

(2) INFORMATION UBER SEQ ID NO: 55: 

(A) LANGE: 103 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 55: 

RVPQPALGWC RVDVGHRGHQ EGSESLGPHQ HTHLMLSRIL EGDLWASSGQ RQGGPQTGHR 60 
MKWAVECVFL WPPNSHSASQ ISGNTSLFLQ AHPGRRIQES SFP • 103 

(2) INFORMATION UBER SEQ ID NO: 56: 
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(A) LANGE: 81 Aminosauren - 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 56: 

RCSFHTSGSW PRARRHHHSN SAAGGRRTCP HISCVAGTAS GKESWGPLGL RVSRGAWRCR6 0 
KWQRQLRCSL GEPWLWWAV E 81 

(2) INFORMATION UBER SEQ ID NO: 57: 

(A) LANGE: 125 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 57: 

RAARADSARA FPLPACKVW PQGPPPGHVG AAGQAFPSFE RGFRCRSRAS GLRSSLPSFR 60 
SWASPPPTH QSRCILGRAL GAMAPRGRKR KAEAAWAVA EKREKLANGG EGMEEAT WI 120 
EHCTS • 125 

(2) INFORMATION UBER SEQ ID NO: . 58: 

(A) LANGE: 1 19 Aminosauren 

(B) TYP: Protein . 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(II) MOLEKULTYP: ORF 
iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 58:' 

QRSPPPFPPR RSPASLASRL RRPPRPQPYA SSRGEPWRLE PGRECSGTGG WGAETRPLSG 60 
NWATKSAARK LCSYSGNLSQ RKGKLGPQHP RGLEADLGAQ PLCKQGAGRL EPNRLERLE 119 

(2) INFORMATION ClBER SEQ ID NO: 59: 

(A) LANGE: 128 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 59: 

TRQLVQCSIT TVASSIPSPP FASFSRFSAT ATTAASALRF LPRGAMAPRA RPRMQRDWWV 60 
GGGDATTERK LGNEERSPEA LLLQRKPLSK EGKAWPAAPT WPGGGPWGTT TLQAGSGKAR120 
AESARAAR 128 

(2) INFORMATION UBER SEQ ID NO: 60: 

(A) LANGE: 127 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 60: 

VTVMQFNFEL SFKYVLYSSY SWLKLDHTIA DCMVFTWTPC RMLDYLYSSY AHMLWAGEMK 60 
SSSHQDLLFK WLDNWATKEL ELHLLGFELF WNTLLHFGKS KSSASGALSI ENLPSFALKD120 
VLFFIYT. \21 

(2) INFORMATION UBER SEQ ID NO: 61: 

(A) LANGE: 111 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 
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(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 61 : 

SIGPICSQGL GPGGIPSPIT LIKNGCNCKN PCLIYLQLCS HLQMYLLMLS CQVPMQRWRG 60 
LPLCGWGLWV WKDRYQKNA FKCTNLLINI RCLLKKKKKK KKRVGGVGCI G 111 

(2) INFORMATION UBER SEQ ID NO: 62: 



(A) LANGE: 68 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 62: 

YRSFTTTHKP HPHKGSPRHL CIGTWQLSIR RYICKWEHSC KYIRQGFLQL QPFLIKVIGE60 
GIPPGPRP 68 

(2) INFORMATION UBER SEQ ID NO: 63: 

(A) LANGE: 195 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 63: 

LVQPGGSCSG GRLLGVEFPS APRVRPFERS APAPATSLLG AMTTTTTFKG VDPNSRNSSR 60 
VLRPPGGGSN FSLGFDEPTE QPVRKNKMAS NIFGTPEENQ ASWAKSAGAK SSGGREDLES 120 
SGLQRRNSSE ASSGDFLDLK GEGDIHENVD TDLPGSLGQS EEKPVPAAPV PSPVAPAPVP18 0 
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SRRNPPGGKS SLVLG 



(2) INFORMATION UBER SEQ ID NO: 64: 

(A) LANGE: 164 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 



(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 64: 

VSQSFPSNLL LENTHAMAHR PKSQGQRETC SSKEKKKRQQ YIKCFFLMKQ IQEMYSQAQV 60 
VQFTSMEETD RTTAFRTVRA NPRRGWTCRQ GDFFWMALGP GPPGWAQAQQ ARASLHSAPG12 0 
CLASLCPHFH EYHLLPSDLR SLRSLLQRSS FSAVQMTPSL PCHH 164 

(2) INFORMATION OBER SEQ ID NO: 65: 

(A) LANGE: 106 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



(ii) MOLEKULTYP: ORF 



(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 65: 

FQAVSLYIQA FLCVRAKALL ISQPVLLLSG YFLRLKNKRQ FLCFAGGKAG GAGLFIVHMS 60 
QEEALSKGHW QVRATPRRLC GETPCGLGPG RNGACGLFMV CPVEAW 106 

(2) INFORMATION UBER SEQ ID NO: 66: 

(A) LANGE: 349 Aminosauren 

(B) TYP: Protein 

(C) "STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 



(iii) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 66: 

AALRSDAGMK RALGRRKGVW LRLRKILFCV LGLYIAIPFL IKLCPGIQAK LIFLNFVRVP 60 
YFIDLKKPQD QGLNHTCNYY LQPEEDVTIG VWHTVPAVWW KNAQGKDQMW YEDALASSHP120 
IILYLHGNAG TRGGDHRVEL YKVLSSLGYH WTFDYRGWG DSVGTPSERG MT YDALHVFD1 8 0 
WIKARSGDNP VYIWGHSLGT GVATNLVRRL CERETPPDAL ILESPFTNIR EEAKSHPFSV2 4 0 
IYRYFPGFDW FFLDPITSSG IKFANDENVK HISCPLLILH AEDDPWPFQ LGRKLYSIAA300 
PARSFRDFKV QFVPFHSDLG YRHKYIYKSP ELPRILREFL GKSEPEHQH 349 

(2) INFORMATION UBER SEQ ID NO: 67: 

(A) LANGE:191 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 67: 

SGLSRLGPGR NQHAGQDVLC EVAAALHQVL KELLGQGIDY EKILKLTADA KFESGDVKAT 60 

VAVLSFILSS AAKHSVDGES LSSELQQLGL PKEHAASLCR CYEEKQSPLQ KHLRVCSLRM120 

NRIiAGVGWRV DYTLSSSLLQ SVEEPMVHLR LEVAAAPGTP AQPVAMSLSA DKFQVLLAEL1 8 0 

KQAQTLMSSL G 191 

(2) INFORMATION UBER SEQ ID NO: 68: 

(A) LANGE: 164 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 68: 

FFFFFFFFFF FFFFSLLYFC LFFLLMKTAN NCLSREGKVM LGKVLRSPEP SSQERSEAAG 60 
DLGGQSPGQG LSILEPGLPP EEQFRGRDSI RAGRLHTGLE HPSPQPRELI RVWACFSSAR1 20 
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RTWNLSAERD MATGWAGVPG AAATSSRRCT MGSSTDC&RL: ELRV 164 

(2) INFORMATION UBER SEQ ID NO: 69: 

(A) LANGE: 155 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 
I) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 69: 

NQGCLPKSSS EGVTPYGQGG STQAWNTLLL SPGSSSGSGP ASVLPGGPGT CLLRGTWQQA 60 
GLGSLGQLPP PAAGAPWALP RIAAGWSSGC SPPASPHLPT YSCVGCRPAS ASARGFASPH120 
NSGTGWPRAL WAAPAAAVHW TRIRHRHCAW PflWRG 155 

(2) INFORMATION OBER SEQ ID NO: 70: 

(A) LANGE: 35 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
HYPOTHETISCH: ja 



(vl) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 70: 

RRAAVTWVWL GVLCFESAVF TPTEWRTCR LLRFS 35 

(2) INFORMATION UBER SEQ ID NO: 71 : 

(A) LANGE: 32 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 71: 

KRLTQNTTPP TRPKSQLHVF KTSFKVSYFS IS 

(2) INFORMATION OBER SEQ ID NO: 72: 

(A) LANGE: 37 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 72: 

ENRSNLHVLT TSVGVKTADS KHNTPNQTQV TAARLQN 

(2) INFORMATION UBER SEQ ID NO: 73: 

(A) LANGE: 121 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 73: 

LVKGMTVLEA VLEIQAITGS RLLSMVPGPA RPPGSCWDPT QCTRTWLLSH TPRRRWISGL 60 
PRASCRLGEE PPPLPYCDQA YGEELSIRHR ETWAWLSRTD TAWPGAPGVK QARILGELLL120 
V 121 

(2) INFORMATION UBER SEQ ID NO: 74: 

(A) LANGE: 115 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



107 



DE 198 18 619 A 1 



(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 74: 

QACPWASLAQ GQRTRLRRKL DTPVHGGI/3L EGWLSGLEVP GGLPAGTRPS AAGWAVPCCC 60 
CPQGLAWAE DGTLSGWIRS PGSSSSRELR HKAGARLYTC RTQESLLQFL PEAPR 115 

(2) INFORMATION UBER SEQ ID NO: 75: 

(A) LANGE: 117 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 75: 

RWHLIRLDQV TRQQQLSRAE AQGRGPAVHL QDPGEPVAVL ARSAEIASSV SLQQEQNQLW 60 
PRWVGGSAFL AMAAATPRQE TAECLEGCNT RSNRQPPL^L MSDGQALQHL DRHGGWS 117 

(2) INFORMATION UBER SEQ ID NO: 76: 

(A) LJWGE: 66 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 76: 

PPQERRTIFV LYPRGSGREN MESGFYRLIG PIHKGHDWEK VWEQKENWDF RVQYAHPKLL60 
VAWGMS 66 

(2) INFORMATION OBER SEQ ID NO: 77: 
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(A) LANGE: 81 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 77: 

ALSTRAMTGK RYGSRRRIGI LGCSTLTLNF WWPGACLEAQ TVKQALLACL LVTTSAPAVL60 
RLHPAPGTPP APEPPLSPCD G 81 

(2) INFORMATION UBER SEQ ID NO: 78: 

(A) LANGE: 104 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 78: 

TLLTIHIWTR DTVHEVHPSQ GDSGGSGAGG VPGAGWSLKT AGAEWTSKQ ASRACLTVWA 60 
SRHAPGHQKF RVSVLHPKIP ILLLLPYLFP VMALVDRAYQ SIES 104 

(2) INFORMATION UBER SEQ ID NO: .79: 

(A) LANGE: 104 Aminosauren 

(B) TYP: Protein . 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 79: 

PSSPRAVRHS GHQDSKMASV VPVKDKKLLE VKLGELPSWI LMRDFSPSGI FGAFQRGYYR 60 
YYNKYINVKK GSISGITMVL ACYVLFSYSF SYKHLKHERL RKYH 104 

(2) INFORMATION UBER SEQ ID NO: 80: 

(A) LANGE: 82 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 80: 

RRGASRGLPW CWHATCSLAT PFPTSISSTS GSANTTEEDT LCTPPPHDLG PSPSVRNTIS60 
IVAESFHILI GINLQIKHDW YV 82 

(2) INFORMATION UBER SEQ ID NO: 81: 

(A) LANGE: 115 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 81: 

KDSATIEIVF LTEGLGPRSW GGGVQSVSSS VVFAEPLVLE MLVGKGVAKE HVACQHHGNP 60 
RDAPLLHIDV LVWPWTSU KRSENATRTE VPHQDPAWQL PQFDLQKFLV LHWYN 115 

(2) INFORMATION UBER SEQ ID NO: 82: 

(A) LANGE: 187 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
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b) eine allclische Variation der unter a) genannten Nuklcinsaure-Sequenzen 
dder 

c) eine Nukleinsaure-Sequenz, die komplementar zu den unler a) oder b) genannten Nuklcinsaure-Sequenzen 
ist. 

2. Eine Nukleinsaure-Sequenz gemaB einer der Scquenzen Seq. ID Nos. 1-50, oder eine komplementare oder aUe- 5 
lische Variante da von. 

3. Nukleinsaurc-Sequenz Seq. ID No. 1 bis Seq. ID No. 50, dadurch gekennzeichnet, daB sie in Blasentumorgc- 
webe crhoht exprimicrt sind. 

4. BAG, PAC und Cosmid-Klonc, enlhaltend funktionclle Gene und ihre chromosomale Lokalisation, entsprechend 
den Scquenzen Seq. ID. No. 1 bis Seq. ID No. 50, zur Verwcndung als Vchikel zum Gentransfer. 10 

5. Eine Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 4, dadurch gekennzeichnet, daB sie eine 90%ige Ho- 
mologie zu einer humancn Nukleinsaure-Sequenz aufweist. 

6. Eine Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 4, dadurch gekennzeichnet, daB sie eine 95%ige Ho- 
mologie zu einer human en Nukleinsaure-Sequenz aufweist. 

7. Eine Nukleinsaure-Sequenz, umfassend einen Teil der in den Anspriichen 1 bis 6 genannten Nukleinsaure-Se- 15 
quenzen, in solch einer ausreichenden GroBc, daB sie mil den Scquenzen gemaB den Anspriichen 1 bis 6 bybridisie- 
ren. 

8. Ein Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 7, dadurch gekennzeichnet, daB die GroBe des Frag- 
ments eine Lange von mindestens 50 bi s 4500 bp aufweist. 

9. Eine Nukleinsaurc-Sequenz gemaB den Anspriichen 1 bis 7, dadurch gekennzeichnet, daB die GrdBe des Frag- 20 
ments eine Lange von mindestens 50 bis 4000 bp aufweist. 

10. Eine Nukleinsaure-Sequenz gemaB einem der Anspriiche 1 bis 9, die mindestens eine Ttilsequenz eines biolo- 
gisch aktiven Polypeptids kodiert. 

1 1 . Eine Expressionskassette, umfassend ein Nukleinsaure-Fragrnent oder eine Sequenz gemaB einem der Ansprii- 
che 1 bis 9, zusammen mit mindestens einer Kontroll- oder regulatorischen Sequenz. 25 

12. Eine Expressionskassette, umfassend ein Nukleinsaure-Fragrnent oder eine Sequenz gemaB Anspruch 11, wo- 
rm die Kontroll- oder rcgulalorische Sequenz ein geeigneter Promoter ist. 

13. Bine Expressionskassette gemaB einem der Anspriiche 11 und 12, dadurch gekennzeichnet, daB die au f der Kas- 
serte befindlichen DNA-Sequenzen ein 

Fusionsprotein kodieren, das ein bekanntes Protein und ein biologiscb aktives Polypeptid-Fragment umfafit. 30 

14. Verwendung der Nuklcinsaure-Sequenzen gemaB den Anspriichen 1 bis 10 zur HersteUung von \bllangen-Ge- 
nen. 

15. Ein DNA-Fragmcnt, umfassend ein Gcn f das aus der Verwendung gemaB Anspruch 14 erhaltlich ist. 

16. Wirtszelle, enthaitend als heterologen Teil ihrer exprimicrbaren generischen Information ein Nukleinsaure- 
Fragrnent gemaB einem der Anspriiche 1 bis 10. 35 

17. Wirtszelle gemaB Anspruch 1 6, dadurch gekennzeichnet, daB cs ein prokaryontisches oder eukaryontischc Zell- 
system ist. 

18. Wirtszelle gemaB einem der Anspriiche 16 oder 17, dadurch gekennzeichnet, daB das prokaryontischc Zellsy- 
stem E. coli und das eukaryontische Zellsystem ein tierisches, humanes oder Hefe-ZeUsystem ist 

19. Ein Verfahren zur Hcrstellung eines Polypeptids oder eines Fragments, dadurch gekennzeichnet, daB die Wirts- 40 
zellen gemaB den Anspriichen 16 bis 18 kultiviert werden. 

20. Ein Antikorper, der gegen ein Polypeptid oder ein Fragment gerichtet ist, welches von den Nukleinsauren der 
Sequenzen Seq. ID No. 1 bis Seq. ID No. 50 kodiert wird, das gemaB Anspruch 19 erhaltlich ist. 

21. Ein Antikorper gemaB Anspruch 20, dadurch gekennzeichnet, daB er monoklonal ist. 

22. Ein Antikorper gemaB Anspruch 20 dadurch gekennzeichnet, daB er ein Phage-Display-Anukorper ist 45 

23. Polypeptid-Tcilsequenzen, gemaB den Sequenzen Seq. ID Nos. ORF 5 1-106. 

24. Porypeptid-Teilscquenzen gemaB Anspruch 22, mit mindestens 80%iger Homologie zu diescn Sequenzen. 

25. Ein aus einem Phage-Display hervorgegangenen Polypeptid, welches an die Polypeptid-leilsequenzen gemaB 
Anspruch 24 binden kann. 

26. Polypeptid-Teilsequenzen gemaB Anspruch 22, mit mindestens 90%iger. Homologie zu diesen Sequenzen. 50 

27. Verwendung der Polypeptid-Teilsequenzen gemaB den Scquenzen Seq. ID No. 51-106, als Tools zum Auffin- 
den von WirkstofTen gegen den Blasentumor. 

28. Verwendung der Nukleinsaure-Scquenzen gemaB den Sequenzen Seq. ID No. 1 bis Seq. ID No. 50 zur Expres- 
sion von Polypeptiden, die als Tools zum Auffinden von WirkstofTen gegen den Blasentumor verwendet werden 
konncn. 55 

29. Verwendung der Nuklcinsaure-Sequenzen Seq. ID No. 1 bis Seq. ID No. 50 in sense oder antisense Form. 

30. Verwendung der Polypeptid-Teilsequenzen Seq! ID No. 51—106 als Arzneimittel in der Gentberapie zur Be- 
handlung des Blasentumors. 

31. Verwendung der Polypepud-leilsequcnzen Seq. ID No. 51-106, zur Hcrstellung eines Arzncimittels zur Be- 
handlung gegen den Blasentumor. 60 

32. Arzneimittel, enthaitend mindestens cine Polypeptid-Teilsequenz Seq. ED No. 51-106. 

33. Eine Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 10, dadurch gekennzeichnet, daB cs eine genomische 
Sequenz ist. 

34. Eine Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 10, dadurch gekennzeichnet, daB es eine mRN A- Se- 
quenz ist. 65 

35. Genomische Gene, ihre Promotoren, Enhancer, Silencer, Exonstruktur, Intronstruktur und dcren SpleiBvarian- 
ten, erhaltlich aus den cDNAs der Scquenzen Seq. ID No. 1 bis Seq. ID No. 50. 

36. Verwendung der genomischen Gene gemaB Anspruch 33, zusammen mit geeigneten regula riven Elcmenten. 
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37. Verwendung gemaB Anspruch 34, dadurch gekennzeichnct, daB das regulative Element ein gceigneter Promo- 
tor und/oder Enhancer ist. 

38. Einc Nuklcirisaurc-Scquenz gemaB den Anspriichen 1 bis 7, dadurch gekennzeichnct, dafi die GroBe des Frag- 
ments eine Lange von mindestens 300 bis 3500 bp aufweist. 

Hierzu lOSeite(n) Zeichnungen 



122 



ZEICHNUNGEN SEITE 1 



Nummer. 
Int. CI. 6 : 

Often leg ungstag: 
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C07K 16/00 
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Systematische Gen-Suche in der Incyte UfeSeq Daterv 

banK 




\ J 

Gene von Interesse 



Fig. 1 
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Nummer: 
Int CI. 6 : 

Offcnlogungstag: 



DE19818619A1 
C07K 16/00 

28. Oktobor 1999 



Prinzip der EST-Assemblierung 

-50.000 ESTs pro Gewebe 



Contigs 




Assemblierung bei 0% Mismatch 
mit GAP4 (Staden) 



Singletons 



In Anzahl und Lange 
zunehmende Contigs , 



Iterative Assemblierung 

mit steigendem 
Mismatch (1%,2%,4%) 



5000-6000 Contigs -25.000 iibrige Singletons 

\/ 

-30.000 Konsensus- 
sequenzen pro Gewebe 

Fig. 2a 
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ZEICHNUNGEN SEITE 3 Nummen DE19818 619A1 

lnt.CI. 6 : C 07 K 16/00 

Offonlcgungstag: 28. Oktober 1999 



-50.000 ESTs 
eines Gewebes 
(z.B.: Blase Tumor) 



GAP4 assembly 1 . Runde: 

minimum initial match: 20 
maximum pads per read: 8 
maximum percent mismatch: 0 



nicht 
assemblierte 
ESTs 




GAP4 assembly 3. Runde: 

minimum Initial match: 20 
maximum pads per read: 8 
maximum percent mismatch: 2 




fr 

GAP4-Datenbank 3: 
Conrigs 3 Singletons 3 



(r 



% 



nicht 
assemblierte 
ESTs 



Fig. 2b1 
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Fig. 2b2 
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f'f A 

Consensus 3 



Consensus 4 




Consensus 5 



Singletons 5 




V 



GAP4 assembly 6. Runcle: 

minimum initial match: 20 
maximum pads per read: 8 
maximum percent mismatch: 4 




assemblierte Datenbank 
eines spezlflschen Gewebes 
(z.B.: Blase Tumor) , ■ 



Fig. 2b3 
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* assemblierte Datenbank 
elnes spezifischen Gewebes 
(z.B.: Blase Tumor) 



V 



Consensus 6 



V 



Einlesen als Singletons 



v 



Datenbank elnes 
spezifischen Gewebes 
(z.B.: Blase Tumor) 



Datenbank eines zweiten 
spezifischen Gewebes 
(z.B.: Blase Normal) 



GAP4 assembly 

minimum Initial match: 20 
maximum pads per read; 8 
maximum percent mismatch: 4 




Tumor-Gewebs 
spezlflsche 



nicht Gewebs- 
spezifische 
ESTs 



i 



Normal-Gewebs 
spezifische 

u ESTs // 



Fig. 2b4 
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Nummer: 
Int. CI. 6 : 

Offcnlegungstag: 
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28. Oktobcr 1999 




Gene von Interesse 



Bestimmung der 
gewebsspezifischen 
Expression uber 
elektronischen Northern 
( INCYTE LifeSeq und 
offentliche EST 
Datenbanken) 




Kandidatengene fur 
Tumorsuppressoren oder 
Tumorakfivatoren 

\ : ) 



Fig. 4a 
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Nummen 
Int CI. 6 : 

Offcnlogungstag: 



DE19818 619A1 
C07K ItyDO 

28. Oktober 1999 



Fartielle 
cDNA 
Sequenz z.B. 
ESTo.Contig/ 



GCCTCAAGTTATC 



WHILE Q > Q.j 



Elektronischer Northern-Blot 



Fishers Exakter Test 



IF Ho 



► EXIT 



Automatische Verlangerung 




ATGTCCTA GCCTCAAGTTATC AGATGCAA, 

i ~ ~~ ~~ 



Fig. 4b 
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ZEICHNUNGEN SEI7E 10 Nummer: DE198 18619 A1 

IntCI. 6 : C07K iBJOO 

Offenlegungslag: 28. Oktober 1999 




Chromosomale Klon-Lokalisation uber FISH 



ii i ii i ■■ i bi i » 



Hybridisierungssignal 




Sequenzierung von Klonen, die in Regionen 
lokalisiert sind, die chromosomale Deletionen 

in Prostata- und Brustkrebs aufweisen, 
fiihrt zur Identifizierung von Kandidatengenen 



Exon Intron 



Bestatigung der Kandidatengene durch 
Screening von Mutationen und/oder 
Deletionen in Krebsgeweben 



Fig. 5 
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(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 82: 

ARAARGARRT SRAVTPTCAT PAGPMPCSRL PPSLRCSLHS ACCSGDPASY RLWGAPLQPT 60 
LGWPQASVP LLTDLAQWEP VLVPEAHPNA SLTMYVCTPV PHPDPPMALS RTPTRQISSS120 
DTDPPADGPS NPLCCCFHGP AFSTLNPVLR HLFPQEAFPA HPIYDLSQVW SWS PAPSRG 180 
QALRRAQ 187 

(2) INFORMATION UBER SEQ ID NO: 83: 

(A) LANGE: 241 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 83: 

FFFFGLSNRC LLRAYAVLRL PFREPHECEA WPLPPGLQAP SLETPRNSRR LLSSSSTQST 60 
SSQPLLGPPE CLSPAGCGGH HGPDLAQVID GVGREGFLGE EVPEHRVKGG ECWAMETAAE120 
RVGGAICRRI CVTRADLPGG SPGEGHGRVR VGHRGADIHG QTCVRMCLRN QORLPLGQVC1 8 0 
EEWHRGLGHH TQCGLQRGPP EPIAGRVPRA AGRVQGAAQG WRQPAAGHGP RWRCTS RCHS 2 4 0 
T . 241 

(2) INFORMATION UBER SEQ ID NO: 84: 

(A) LANGE: 113 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 



(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 84: 
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MGWAGKASWG KRCLSTGLRV ENAGPWKQQQ RGLEGPSACG' 3VSLEI»ICPV GVLERAMGGS 60 
GWGTGVQTYM VRLALGCASG TRTGSHWARS VRSGTEAWG?" TPSVGOSGAP QSL 113 

(2) INFORMATION OBER SEQ ID NO: 85: 

(A) L^NGE:107 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEK0LTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 85: 

AFLSFLFSER FKASTTLFPP SLLNLICTKS FALVGWETA LSLSTSVREC EPPWQVPVQG 60 
PAALHLGRVT GAPAVCPBCAS PWPFGLSLGR FRTEHQGRQA FQGISIN 107 

(2) INFORMATION UBER SEQ ID NO: 86: 

(A)U\NGE:107 Aminosauren 
(B.)TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEK0LTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(») SEQUENZ-BESCHREIBUNG: SEQ ID NO 86: 

LRNRLWKVKL EEPDLISPTS KTPSEQQRPQ HPPRTGDSIF MATPCGGRLT TSHHIIPELS 60 
SSSGMTPSPP PPSSSFSSFC LFVSELSCLS FFLRDSKPPR LCFPRPF 107 

(2) INFORMATION UBER SEQ ID NO: 87: 

(A) LANGE:115 Aminosauren 

(B) TYP: Protein 

(C) "STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 87: 

IQKGRGKQSR GGFESLRKKE RQESSETKRQ KDEKEEEGGG GDGVZPLEEL SSGMMWWLW 60 
NRPPQGVAMK MESPVRGGCC GRCCSDGVFD VGLMRSGSSS FTFQSRFLSQ WQWL 115 

(2) INFORMATION UBER SEQ ID NO: 88: 



(A) LANGE: 124 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 88: 

CSVRNLPRLR PKGQGEAFGH TAGAPVTRPR WRAAGPCTGT CQGGSHSRTL VLRLKAVSTT 60 
PTSANDLVQI KFRRDGGNRV VEALNLSEKR KDRKAQKQRD RRMKKKKREV VGTASSRWRS120 
SALG 124 

(2) INFORMATION UBER SEQ ID NO: 89: 

(A) LANGE: 198 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 89: 

EGAGGEWRCP AAGGPRGEDG PPGLRLTERA GLTTRHLTGT ADPSQKHNLR DPSSSKDERI 60 
QQTQPNREGA CSLNLHGLRY LCKPGVAPVL RVILRSCLFP NGFTSGSCRF SLGLSLILKW120 
VAGGRGWLRP LLALRLQSWE QDTSPEFHFF SCPNHAHTIV QNQSTFEKWL HGHPPGPRKL180 
HSKGLFTWQQ NPSPAVSP 198 

(2) INFORMATION UBER SEQ ID NO: 90: 
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(A) LANGE: 124 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 90: 

LPPVEPPVLK GSCRKDMHRN EERERSENEV WRARPGPTAQ GSSPPPDAPF HPPPQCLLSP 60 
KAPASEVLGA HPPQCGQGGK GQVLDTAKCP EMTLLLTHFF GPWQSPTCPQ HGAPGRTGRQ120 
EGGW 124 

(2) INFORMATION OBER SEQ ID NO: 91: 



(A) LANGE: 147 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzet 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 91: 

NSGEVSCSQL CSLRASRGRS HPLPPATHFK MRLRPRLKRQ LPEVNPFGKR HERRMTLRTG 60 
ATPGLHKYRR PWRLRLQAPS LLGCVCCILS SLELLGSLRL CFWDGSAVPV RCLWRPALS 120 
VSLSPGGPSS PLGPPAAGHL HSPPAPS 147 

(2) INFORMATION UBER SEQ ID NO: 92: 



(A) LANGE: 374 Aminosauren 

(B) TYP: Protein ' 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(II) MOLEKULTYP: ORF 
(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 92: 

SREAPESRRW AVWRSLESLP RHQLLCLPVG APPAPAMLSA LARPASAALR RSFSTSAQNN 60 
AKVAVLGASG GIGQPLSLLL KNSPLVSRLT LYDIAHTPGV AADLSHIETK AAVKGYLGPE120 
QLPDCLKGCD VWIPAGVPR KPGMTRDDLF NTNATIVATL TAACAQHCPE AMI CVI ANPV1 8 0 
NSTIPITAEV FKKHGVYNPN KIFGVTTLDI VRANTFVAEL KGLDPARVNV PVIGGHAGKT2 4 0 
IIPLISQCTP KVDFPQDQLT ALTGRIQEAG TEWKAKAGA GSATLSMAYA GARFVFSLVD300 
AMNGKEGWE CSFVKSQETE CTYFSTPLLL GKKGIEKNLG IGKVSSFEEK MI S DAI PELK3 6 0 
ASIKKGEDFV KTLK 374 

(2) INFORMATION OBER SEQ ID NO: 93: 

(A) LANGE: 238 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEK0LTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 93: 

LNEGTFHNTF LSIHCIHKGE DKAGAGIRHG EGGRACSGFS LDHLRAGLLD PPSECCQLVL 60 
GKVHLGGALR DQGDDGLPSM ATNDRDVDSS WIQTLQLCNK GVGSDDVQGR HAEDFVGWH120 
SMLLENFCCD GDGGINRIGN DADHGFRAVL GTGSGQGGHN RGIGVEQWP GHAWLSGDSS180 
RNNYHITTFQ AVRQLFRSEV AFHSGFGLDV AQICGHSGCV RDIIEGQAAH QGAVLQEK 238 

(2) INFORMATION UBER SEQ ID NO: 94: 

(A) LANGE: 242 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 94: 

EAGEEKAEEE GVAEEEGVNK FSYPPSHREC CPAVEEEDDE EAVKKEAHRT STSALSPGSK 60 

PSTWVSCPGE EENQATEDKR TERSKGARKT SVSPRSSGSD PRSWEYRSGE ASEEKEEKAH1 2 0 

KETGKGEAAP GPQSSAPAQR PQLKSWWCQP SDEEEGEVKA LGAAEKDGEA ECPPCIPPPS180 

AFLKAWVYWP GEDTEEEEDE EEDEDSDSGS DEEEGEAEAS SSTPATGVFL" KSWVYQPGED240 

TQ 242 
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(2) INFORMATION UBER SEQ ID NO: 95: 

(A) LANGE: 237 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 95: 

RPGCIGQERT LDSEDKEDDS EAALGEAESD PHPSHPDQRA HFRGWGYRPG 
DWGEAEFCPF RVAIYVPGEK PPPPWAPPRL PLRLQRRLKR PETPTHDPDP 
FSEKVTVHFL AVWAGPAQAA RQGPWEQLAR DRSRFARRIT QAQEELSPCL 
RLRNPPLAPI PALTQTLPSS SVPSSPVQTT PLSQAVATPS RSSAAAAAAL 

(2) INFORMATION UBER SEQ ID NO: 96: 

(A) LANGE: 890 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 96: 

QDEHLITFFV PVFEPLPPQY FIRWSDRWL SCETQLPVSF RHLILPEKYP PPTELLDLQP 60 
LPVSALRNSA FESLYQDKFP FFNPIQTQVF NTVYNSDDNV FVGAPTGSGK TICAEFAILR120 
MLLQSSEGRC VYITPMEALA EQVYMDWYEK FQDRLNKKW LLTGETSTDL KLLGKGNI I 1180 
STPEKWDILS RRWKQRKNVQ NINLFWDEV HLIGGENGPV LEVICSRMRY I SSQI ERPI R2 4 0 
IVALSSSLSN AKDVAHWLGC SATSTFNFHP NVRPVPLELH IQGFNISHTQ TRLLSMAKPV3 0 0 
YHAITKHSPK KPVIVFVPSR KQTRLTAIDI LTTCAADIQR QRFLHCTEKD LI PYLEKLSD3 60 
STLKETLLNG VGYLHEGLSP MERRLVEQLF SSGAIQVWA SRSLCWGMNV AAHLVIIMDT4 20 
QYYNGKIHAY VDYPIYDVXQ MVGHANRPLQ DDEGRCVIMC QGSKKDFFKK FLYEPLPVES4B0 
HLDHCMHDHF NAEIVTKTIE NKQDAVDYLT WTFLYRRMTQ NPNYYNLQGI SHRHLSDHLS5 4 0 
ELVEQTLSDL EQSKCISIED EMDVAPLNLG MIAAYYYINY TTIELFSMSL KAKTKVRGLI 600 
EIISNAAEYE NIPIRHHEDN LLRQLAQKVP HKLNNPKFND PHVKTNLLLQ AHLSRMQLSA660 
ELQSDTEEIL SKAIRLIQAC VDVLSSNGWL SPALAAMELA QMVTQAMWSK DSYLKQLPHF7 2 0 
TSEHIKRCTD KGVESVFDIM EMEDEERNAL LQLTDSQIAD VARFCNRYPN I ELS YEWDK7 8 0 
DSIRSGGPW VLVQLEREEE VTGPVIAPLF PQKREEGWWV VIGDAKSNSL I SI KRLTLQQ8 4 0 
KAKVKLDFVA PATGAHNYTL YFMSDAYMGC DQEYKFSVDV KEAETDSDSD 890 

(2) INFORMATION UBER SEQ ID NO: 97: 



KETEEEEAAE 60 
ETPLKARKVR120 
T PAARARAWA1 8 0 
DLSGRRG 237 
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(A) LANGE: 281 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 97: 

GDGSAEHGPR PLAAPLVTSR GAPASARPRG ALPGGSAPSA PHGQLPGRAQ PAPVSGPPPT 60 

SGLCHFDPAA PWPLWPGPWQ LPPHPQDWPA QPDIPQDWVS FLRSFGQLTL CPRNGTVTGK120 

WRGSHWGLL TTLNFGDGPD RNKTRTFQAT VLGSQMGLKG SSAGQLVLIT ARVTTERTAG1 8 0 

TCLYFSAVPG ILPSSQPPIS CSEEGAGNAT LSPRMGEECV SVWSHEGLVL TKLLT SEELA240 

LCGSRLLVLG SFLLLFCGLL CCVTAMCFHP RRESHWSRTR L 281 

(2) INFORMATION LIBER SEQ ID NO: 98: 

(A) LANGE: 206 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 98: 

RLEPRSVTRS RRAVSRLSAR PGKVSAVMAF LASGPYLTHQ QKVLRLYKRA LRHLESWCVQ 60 
RDKYRYFACL MRARFEEHKN EKDMAKATQL LKEAEEEFWY RQHPQPYIFP DSPGGTSYER120 
YDCYKVPEWC LDDWHPSEKA MYPDYFAKRE QWKKLRRESW EREVKQLbEE. TPPGGPLTEA180 
LPPARKEGDL PPLWWYIVTR PRERPM 206 

(2) INFORMATION UBER SEQ ID NO: 99: 

(A) LANGE: 139 Aminosauren 

(B) -TYP: Protein 

. (C) STRANG: einzel 
(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 99: 

PLVPSFPSAV SSTVLSWQSN QDTLPSQKDA SHLSTILGPC SNRISHRRCP QESQGRCMAV 60 
DADGTRILPR PPSAAGWPSP YPFHSYVLQT GLSSNKQSIG ICLSGRTTTR GGVAPAYKAA1 2 0 
TPFADGSGRV PTPRTPLRR 139 

(2) INFORMATION UBER SEQ ID NO: 100: 

(A) LANGE: 79 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 100: 

APFWDLVAIV SLIGGAPRRV REDWfLWMLT VPEFFLGLLQ QLGGLRHILF ILMFFKPGSH60 
QTSKVSVFVS LDAPRLEVA 79 

(2) INFORMATION UBER SEQ ID NO: 101: 

(A) LANGE: 89 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 101: 

VGGACAVALP QAAfiMAGQED PVQREIHQDW ANREYIEIIT SSIKKIADFL NSFDMSCRSR60 
LATLNEKLTA LERRIEYIEA RVTKGETLT "89 

(2) INFORMATION UBER SEQ ID NO: 102: 
(A) LANGE: 88 Aminosauren 
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(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 102: 

NSAVLLLGSC FTQHRPHGKG PSSLQLLPFS LKSNRAYSCF SFFKSVAFGL CHLGCGWCG60 
KKFRGTVGND VRHFTFSVTF YTSTCQCI 88 

(2) INFORMATION UBER SEQ ID NO: 103: 

(A) LANGE: 89 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 103: 

HFIHLLVNVF ETFTAKSLGL FVKVLLTSIF LSLSLKLSLK FSLPLHCFCE QSLSPPHLWW60 
EVRGSPGQDT HALAMWLPEN Wi^JPPVSC 89 



(2) INFORMATION UBER SEQ ID NO: 104: 

(A) LANGE: 240 Aminosauren . 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 



(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 104: 

REQILFIEIR DTAKGGETEQ PPSLSPLHGG RMPEMGEGIQ SLARETQSHR GRRQGWDATW 60 
VTRCRESLNR GGAGAGKRAG ALAHHVFLAL IEPKLAEREA SEEEVKACSD ETWADLLVK1 2 0 
WYVLGAILK IFLREGNVLN QHSGMDIEKY SEHYQHDHSP GAEDDAAGGQ LRPT AQERRH 180 
KEGSRGSPRC KRARKAVGES PGCPRRGAEG AWPRPQPSHP GAAGGKGWAP LRTFAGDGAV240 

(2) INFORMATION UBER SEQ ID NO: 105: 

(A) LANGE: 136 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 1 05: 

RLYMFWGPFS KSFSVKGTSS INTAEWTLKN IPSIINMITP QALRTMPQAA SFGPQHRSDA 60 
IKKGVGDRRG ASGLGKRWEK AQDALAGGQR GRGPGLNHPI RGRQAEKAGL LSGLSRETAP120 
SETKTAPGET FLDLCS 136 

(2) INFORMATION 0BER SEQ ID NO: 106: 

(A) LANGE: 173 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja. 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 106: 

LQRSRKVSPG AVLVSDGAVS. RESPERSPAF SACRPRMGWL RPGPRPLCPP ARASWAFSHR 60 
FPSPLAPRRS PTPFFMASLL CCGPKLAACG IVLSAWGVIM LIHLGIFFNV HSAVLIEDVP120 
FTEKDFENGP QNIYNLYEQV SYNCFIAAGL YLLLGGFSFC QVRLNKRKEY MVR 173 



Patcntansprucbc 

] . Einc Nukleinsaure-Sequenz, die ein Genprodukt odcr ein Tfcil davon kodiert, umfassend 

a) eine Nukleinsiiure-Sequenz, ausgewahlt aus derGruppe Seq. ID No. 2—5, 7-13, 16, 18, 20, 23, 26-27, 
31-32,36,45. 
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ABSTRACT 

Genetic changes underlie tumor progression and may lead to cancer- 
specific expression of critical genes. Over 1100 publications have de- 
scribed the use of comparative genomic hybridization (CGH) to analyze 
the pattern of copy number alterations in cancer, but very few of the genes 
a fleeted are known. Here, we performed high-resolution CGH analysis on 
cDNA micro arrays in breast cancer and directly compared copy number 
and mRNA expression levels of 13,824 genes to quantitate the impact of 
genomic changes on gene expression. We identified and mapped the 
boundaries of 24 independent amplicons, ranging in size from 0.2 to 12 
Mb. Throughout the genome, both high- and low-level copy number 
changes had a substantial impact on gene expression, with 44% of the 
highly amplified genes showing overexprcssion and 10.5% of the highly 
ove rex pressed genes being amplified. Statistical analysis with random 
permutation tests identified 270 genes whose expression levels across 14 
samples were systematically attributable to gene amplification. These 
included most previously described amplified genes in breast cancer and 
many novel targets for genomic alter ations, including the BOXB7 gene, 
the presence of which in a novel amplicon at I7q213 was validated in 
10.2% of primary breast cancers and associated with poor patient prog- 
nosis. In conclusion, CGH on cDNA microarrays revealed hundreds of 
novel genes whose overexpression is attributable to gene amplification. 
These genes may provide insights to the clonal evolution and progression 
of breast cancer and highlight promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
(1-6). Despite this progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mained elusive, and the utility of gene expression profiling in the 
identification of specific therapeutic targets remains limited. 

Accumulation of genetic defects is thought to underlie the clonal 
evolution of cancer. Identification of the genes that mediate the effects 
of genetic changes may be important by highlighting transcripts that 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets for anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 and EGFR (7, 8), in breast cancer and 
other solid tumors. Besides amplifications of known oncogenes, over 
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A % 




Expression ratio 



Fig. 1. Impact of gene copy number on global gene expression levels. A, percentage of 
over- and undercx pressed genes (Y axis) according to copy number ratios (X axis). 
Threshold values used for over- and undercxpression were >2.I84 (global upper 7% of 
the cDNA ratios) and <0.4B26 (global lower 7% of the expression ratios). B, percentage 
of amplified and deleted genes according to expression ratios. Threshold values for 
amplification and deletion were >K5 and <0.7. 



20 recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 5 (9, 10). However, these amplicons are often 
large and poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes that are attributable to underlying gene copy 
number alterations would highlight transcripts that are actively in- 
volved in the causation or maintenance of the malignant phenotype. 
To identify such transcripts, we applied a combination of cDNA and 
CGH microarrays to: (a) determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and (b) identify and characterize those genes whose mRNA expres- 



5 The abbreviations used are: CGH, comparative genomic hybridization; FISH, fluo- 
rescence in situ hybridization; RT-PCR, reverse transcriptio n-PCR_ 
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Fig. 2. Genome-wide copy number and expression analysis in the MCF-7 breast cancer cell line. A t chromosomal CGH analysis of MCF-7. The copy number ratio profile (blue 
tine) across the entire genome from lp telomere to Xq telomere is shown along with :£ 1 SD {orange fines). The black horizontal line indicates a ratio of 1-0; red line, a ratio of 0,8; 
and green line, a ratio of \2. B-C, genome-wide copy number analysis in MCF-7 by CGH on cDNA microarray. The copy number ratios were plotted as a function of the position 
of the cDNA clones along the human genome. In B> individual data points arc connected with a line, and a moving median of 10 adjacent clones is shown. Red horizontal line, the 
copy number ratio of 1 .0. In C f individual data points arc labeled by color coding according to cDNA expression ratios. The bright red dots indicate the upper 2%, and dark red rfo/j, 
the next 5% of the expression ratios in MCF-7 cells (overcxpressed genes); bright green dots indicate the lowest 2%, and dark green dots, the next 5% of the expression ratios 
(undercxpressed genes); the rest of the observations arc shown with black crosses. The chromosome numbers arc shown at the bottom of the figure, and chromosome boundaries arc 
indicated with a dashed line. 



sion is most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AND METHODS 

Breast Cancer Cell Lines* Fourteen breast cancer cell lines (BT-20, BT- 
474, HCC1 428, Hs578t, MCF7, MDA-361, MDA-436, MDA-453, MDA-468, 
SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and rnRNA were isolated 
using standard protocols. 

Copy Number and Expression Analyses by cDNA Microarrays. The 
preparation and printing of the 13,824 cDNA clones on glass slides were 
performed as described (11-13). Of these clones, 244 represented uncharac- 
terized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14, 
15). Briefly, 20 of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-18 h with Ahtl and Rsal (Life Technol- 
ogies, Inc., Rockville, MD) and purified by phenol/chloroform extraction. Six 
fig of digested cell line DNAs were labeled with Cy3-dUTP (Amersham 
Pharmacia) and normal DNA with Cy5-dUTP (Amersham Pharmacia) using 
the Bioprime Labeling kit (Life Technologies, Inc.). Hybridization (14, 15) and 
posthybridization washes (13) were done as described. For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagene, 
La Jolla, CA) was used in all experiments. Forty u.g of reference RNA were 
labeled with Cy3-dUTP and 3.5 peg of test rnRNA with Cy5-dUTP, and the 
labeled cDNAs were hybridized on microarrays as described (13,15). For both 
microarray analyses, a laser confocal scanner (Agilent Technologies, Palo 
Alto, CA) was used to measure the fluorescence intensities at the target 
locations using the DE ARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by the 
average intensity of the corresponding clone in the control hybridization. For 
the copy number analysis, the ratios were normalized on the basis of the 
distribution of ratios of all targets on the array and for the expression analysis 
on the basis of 88 housekeeping genes, whicb were sported four times onto the 
array. Low quality measurements {i.e„ copy number data with mean reference 
intensity <JO0 fluorescent units, and expression data with both lest and 
reference intensity <100 fluorescent units and/or with spot size <50 units) 
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were excluded from the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define outpoints for increased/ 
decreased copy number. Genes with CGH ratio >1.43 (representing the upper 
5% of the CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <0.73 (representing the lower 5%) were considered to be 
deleted. 

Statistical Analysis of CGH and cDNA Microarray Data. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled 1 for amplification (ratio, > 1 .43) and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-to-noise 
statistics (1). We calculated a weight, yvgy for each gene as follows: 

m B i ~ m go 

where m gXt <r sl and cr^g denote the means and SDs for the expression 
levels for amplified and nonampHfied cell lines, respectively. To assess the 
statistical significance of each weight, we performed 10,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation than the original weight was denoted by tx. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and Amplicon Mapping. Each 
cDNA clone on the microarray was assigned to a Unigene cluster using the 
Unigene Build 141* A database of genomic sequence alignment information 
for rnRNA sequences was created from the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoldenPath database. 7 The chromosome and 
bp positions for each cDNA clone were then retrieved by relating these data 
sets. Amplicons were defined as a CGH copy number ratio >2.0 in at least two 
adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 
adjacent clones in a single cell line. The amplicon start and end positions were 



Internet* address: http^/rcsearch.nhgrin ih.gov/mi 
Internet address; www.gcnome.ucsc.edu. 
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Table 1 Summary of independent amplicons in 14 breast cancer celt lines by 
CGH microarray 



Location 


Start (Mb) 


End (Mb) 


Size (Mb) 


1 pl3 


132.79 


132.94 


0.2 


Iq2I 


173.92 


177.25 


3.3 


lq22 


179.28 


179.57 


0.3 


3pi4 


71.94 


74.66 


2.7 


7pI2.l-7plt.2 


55.62 


60.95 


5.3 


7<{3[ 


125.73 


130.96 


5.2 


7<q32 


140.0] 


140.68 


0.7 


8q21.1 1— 8q21 .13 


86.45 


92.46 


6.0 


3q21.3 


98.45 


103.05 


4.6 


8q23.3-Sq24J4 


129.88 


142.15 


12.3 


Sq24.22 


151.21 


152.16 


1.0 


9pl3 


38.65 


39.25 


0.6 


1 3q22-q3 J 


77.15 


81.38 


4.2 


16q22 


86.70 


87.62 


0.9 


I7ql 1 




JU.OJ 


1 .0 


I7ql2-q21.2 


39.79 


42,80 


3.0 


I7q2132-q21.33 


52.47 


55.80 


3.3 


1 7q22~nq23.3 


63.81 


69.70 


5.9 


17q23.3-q24.3 


69.93 


74.99 


5.1 


I9ql3 


40.63 


41.40 


0.8 


20ql].22 


34.59 


35.85 


1.3 


20qJ3.J2 


44.00 


45.62 


1.6 


20ql3,12-^J3.i3 


46.45 


49.43 


3.0 


20ql3.2-qI3.32 


5132 


59.12 


7.8 



CGH were validated, with lq21, 17ql2-q21.2, 17q22-q23, 20ql3.1, 
and 20ql3.2 regions being most commonly amplified. Furthermore, 
the boundaries of these amplicons were precisely delineated. In ad- 
dition, novel amplicons were identified at 9pl3 (38.65-39,25 Mb), 
and 17q21.3 (52.47-55.80 Mb). 

Direct Identification of Putative Amplification Target Genes. 
The cDNA/CGH microarray technique enables the direct correla- 
tion of copy number and expression data on a gene-by-gene basis 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at lpl3, 17q22-q23, and 20ql3 were highly overex- 
pressed. A view of chromosome 7 in the MDA-468 ceil line 
implicates EGFR as the most highly overexpressed and amplified 
gene at 7pl 1— pl2 (Fig. ZA). In BT-474, the two known amplicons 
at 17ql2 and 17q22-q23 contained numerous highly overex- 
pressed genes (Fig. 35). In addition several genes, including the 
homeobox genes HOXB2 and HOXB 7 a were highly amplified in a 
previously undescribed independent amplicon at 17q21.3. HOXB7 
was systematically amplified (as validated by FISH, Fig. 35, inset) 
as well as overexpressed (as verified by RT-PCR, data not shown) 
in BT-474, UACC812, and ZR-75-30 cells. Furthermore, this novel 



extended to include neighboring nonaniplified clones (ratio, <1.5). The am- 
plicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial cr^ompsojne clone RPH-361K8 was la- 
beled with SpectrumOrange (Vysis, Downers Grove, IL), and Spectrum- 
Orange-labeled probe for EGFR was obtained from Vysis. SpectrumGreen- 
labeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference. A tissue microarray containing 612 formalin-fixed, paraffin-embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
(18). The use of these specimens was approved by the Ethics Committee of the 
University of Basel and by the NIH. Specimens containing a 2-fold or higher 
increase in the number of test probe signals, as compared with corresponding 
centromere signals, in at least 1 0% of the tumor cells were considered to be 
amplified. Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test. 

RT-PCR. The HOXB7 expression level was determined relative to 
GAPDH* Reverse transcription and PCR amplification were performed using 
Access RT-PCR System (Promega Corp., Madison, WI) with 10 ng of mRNA 
as a template. HOXB7 primers were 5*-GAGCAGAGGG ACTCGG ACTT-3 ' 
and 5 '-GCGTC AGGTAGCG ATTGTAG-3 ' . 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH microarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5) were overexpressed (i.e., belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. 1 A). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio, >10) 
showed increased copy number (Fig. \B). Low-level copy number 
increases and decreases were also associated with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for 1 1,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kb. This high-resolution mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to 12 Mb of DNA (Table 
I). Several amplification sites detected previously by chromosomal 
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Fig. 3. Annotation of gene expression data on CGH microarray profiles. A, genes in the 
7pl J -pi 2 amplicon in the MDA-468 cell line arc highly expressed (red dots) and include 
the EGFR oncogene. B, several genes in the 17ql2, 17q21.3, and 17q23 amplicons in the 
BT-474 breast cancer cell line are highly overexpressed (red) and include the HOXB7 
gene. The data labels and color coding are as indicated for Fig. 2C. Insets show 
chromosomal CGH profiles for the corresponding chromosomes and validation of the 
increased copy number by interphase FISH using EGFR (red) and chromosome 7 
centromere probe (green) to MDA-468 (A) and HOXB7-spzcific probe (red) and chro- 
mosome 17 centromere (green) to BT-474 cells (B). 
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Copy number ratto 



Expression ratio 



Fig. 4. List of 50 genes with a statistically 
significant correlation (a value <0.05) between 
gene copy number and gene expression. Name, 
chromosomal location, and the a value for each 
gene are indicated. The genes have been ordered 
according to their position in the genome. The color 
maps on the righr illustrate the copy number and 
expression ratio patterns in the 14 cell lines. The 
key to the color code is shown at the bottom of the 
graph. Gray squares, missing values. The complete 
list of 270 genes is shown in supplemental Fig. B. 
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amplification was validated to be present in 10.2% of 363 primary 
breast cancers by FISH to a tissue microarray and was associated 
with poor prognosis of the patients (P = 0.001). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Amplicons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
270 genes whose expression was significantly influenced by copy 
number across all 14 cell lines (Fig. 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data, 8 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these, 151 
(84%) are implicated in apoptosis, cell proliferation, signal transduc- 
tion, and transcription, whereas 28 (16%) had functional annotations 
that could not be directly linked with cancer. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in > 1000 publications applying CGH 9 (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somatic genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific amplicons (15, 19—21). Here, we applied genome- 
wide cDNA micro arrays to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantial with the most dramatic effects seen in the case of high- 



8 Internet address: http://www.gcneonlology.org/. 



9 Internet address: hu^^/www.ncbLnlm.nib-gov/entrez. 
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level copy number increase. Low-level copy number gains and losses 
also had a significant influence on expression levels of genes in the 
regions affected, but these effects were more subtle on a gene-by-gene 
basis than those of high-level amplifications. However, the impact of 
low-level gains on the dysregulation of gene expression patterns in 
cancer may be equally important if not more important than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24). 

The CGH microarray analysis identified 24 independent breast 
cancer ampl icons. We defined the precise boundaries for many am- 
plicons detected previously by chromosomal CGH (9, 10, 25, 26) and 
also discovered novel amplicons that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger amplicons. One of these novel amplicons 
involved the homeobox gene region at 17q21 .3 and led to the over- 
expression of the HOXB7 and HOXB2 genes. The homeodomain 
transcription factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogenesis in breast cancer (29-32), The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7 contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing —2% of all genes on the array), including not only 
previously described amplified genes, such as HER-2, MYC, 
EGFR, ribosomal protein s6 kinase, and AIB3 t but also numerous 
novel genes such as NRAS-related gene (Ipl 3), syndecan-2 (8q22), 
and bone morphogenic protein (20ql3.1), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of the 270 genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we. demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (b) a high- resolution map of 24 independent 
amplicons in breast cancer; and (c) identification of a set of 270 
genes, the overexpression of which was statistically attributable to 
gene amplification. Characterization of a novel amplicon at 
17q2L3 implicated amplification and overexpression of the 
HOXB7 gene in breast cancer, including a clinical association 



between HOXB7 amplification and poor patient prognosis. Overall, 
our results illustrate how the identification of genes activated by 
gene amplification provides a powerful approach to highlight 
genes with an important role in cancer as well as to prioritize and 
validate putative targets for therapy development. 

REFERENCES 

1. Gotub, T. R., Slontro, D. K., Tamayo, P., Huard, C, Gaasenbcck, M, Mcsirov, J, P., 
Coller.'H., Loh, M. L. t Downing, J. Caligiuri, M. A., Bloornfield. C. D., and 
Lander, £. S. Molecular classification of cancer class discovery and class prediction 
by gene expression monitoring. Science (Wash. DC), 286: 531-537, 1999. 

2. Alizadeh, A. A., Eiscn, M. B., Davis, R. E., Ma, G, Lossos, I. S„ Rosen wald, A., 
Boldrick J. C, Sabet, H., Tran, T., Yu, X., et al Distinct types of diffuse large B-ccll 
lymphoma identified by gene expression profiling. "Nature (Lond), 403: 503-51], 
2000, 

3. Bittncr, M.„ Meltzcr, P.. Chen, Y„ Jiang, Y., Scftor, E., Hendrix, M., Radmachcr, M., 
Simon, R., Yakhini, Z., Ben-Dor, A., et al Molecular classification of cutaneous 
malignant melanoma by gene expression profiling, Nature (Lond.), 406: 536—540, 
2000, 

4. Perou, C. M., Sorlie, T., Eiseu, M. B., van de Rijn, M., Jeffrey, S. S., Rees, C A., 
Pollack, J. R., Ross, D. T.» Johnsen, 11, Akslen, L. A. T et at Molecular portraits of 
human breast tumours. Nature (Lond.), 406: 747-752, 2000. 

5. Dhanasekaran, S. M., Barrcttc, T. R., Ghosh, D., Shah, R., Varambally, S., Kurachi, 
K., Pienta, K. J., Rubin, M. A., and Chinnaiyao, A. M. Delineation of prognostic 
bioraarkers in prostate cancer. Nature (Lond.), 412: 822-826. 2001. 

6. Sorlie, T„ Perou, C. M., Tibshirani, R., Aas, T„ Geislcr, S., Johnsen, H., Hastie, T\, 
Eiscn, M. B., van de Rijn, M., Jeffrey, S. S., et al Gene expression patterns of breast 
carcinomas distinguish tumor subclasses with clinical implications. Proc. Natl. Acad, 
Sci. USA, 98; 1Q869-10874, 2001. 

7. Ross, J. S-, end Ipetcher, J. A. The HER-2/neu oncogene: prognostic factor, predictive 
factor and target for therapy. Semin. Cancer Biol,, 9: 125-138, 1999. 

8. Arteaga. C. L. The epidermal growth factor receptor: from mutant oncogene in 
nouhuman cancers to therapeutic target in human neoplasia. J. Clin. Oncol., 19; 
32-40, 2001. 

9. Knuutila, S., Bjorkqvist, A, M., Autio, K., Tarkkanen, M„ Wolf, M., Mourn. 
Szymanska, J., Larramendy, M. L., Tapper, J„ Pere, H„ El-Rifai, W., et al DNA copy 
number amplifications in human neoplasms: review of comparative genomic hybrid- 
ization studies. Am. J. Pathol., 152: 1 107-1 123, 1998, 

10. Knuutila S., Autio K., and Aalto Y. Online access to CGH data of DNA sequence 
copy number changes. Am. J. Pathol., 157: 689, 2000. 

1 1 . DeRisi, J., Penland, L., Brown, P. O., Bittoer, M. Meltzcr, P. S. ? Ray, M., Chen, 
Y., Su, Y, A., and Trent, J. M. Use of a cDNA microarray to analyse gene expression 
patterns in human cancer. Nat GeneL, 14: 457-460, 1996. 

12. Shalon, D., Smith, S. J., and Brown, P. O. A DNA microarray system for analyzing 
complex DNA samples using two-color fluorescent probe bybridization. Genome 
Res., 6: 639-645, 1996. 

13. Mousses, S.„ Bittncr, M. I_, Chen, Y., Dougherty, E. R-, Baxevanis, A., Meltzer, P. S., 
and Trent, J. M. Gene expression analysis by cDNA microarrays. In: F. J. Uvesey and 
S.P. Hunt (eds.), Functional Genomics, pp. 113-137. Oxford: Oxford University 
Press. 2000. 

14. Pollack, J. R., Perou, C. M., Atizadeh, A. A., Eiscn, M. B., Pergamcnschikov, A., 
Williams, C F., Jeffrey, S. S„ Botstein, D., and Brown, P. O. Genome-wide analysis 
of DNA copy-number changes using cDNA microarrays. Nat Genet, 23: 41-46. 
1999. 

15. Monni, O., Barlund, M., Mousses, S., Kononen, J., Sautcr, G., Hciskanen, M., 
Paavola, P., Avela, JC. Chen, Y., Bitmer, M. L, and Kallionicmi, A. Comprehensive 
copy number and gene expression profiling of the 17q23 amplicon in human breast 
cancer. Proc. Natl. Acad. Sci. USA, 98: 5711-5716, 2001. 

16. Chen, Y., Dougherty, E. R M and Bittncr, M. L. Ratio-based decisions and the 
quantitative analysis of cDNA microarray images. J. Biomcd. Optics, 2: 364-374, 
1997. 

17. Barlund, M., Forozan, F., Kononen, J., Bubcndorf, L„ Chen, Y., Bittncr, M. L., 
Torhorst J., Haas, P., Bucher, C, Sautcr, G., et al. Detecting activation of ribosomal 
protein S6 kinase by complementary DNA and tissue microarray analysis. J. Natl. 
Cancer Inst. 92: 1252-1259, 2000. 

18. Andersen, C. L., Hostctter, G., Grigoryan, A., Sautcr, G, and Kallionicmi, A. 
Improved procedure for fluorescence in situ hybridization on tissue microarrays. 
Cytometry. 45: 83-86, 200 1 . 

19. Kauraniemi, P., BSrlund. M., Monni, 0., and Kallionicmi, A, New amplified and 
highly expressed genes discovered in the ERBB2 amplicon in breast cancer by cDNA 
microarrays. Cancer Res., 61: 8235-8240, 2001. 

20. Clark, J., Edwards, John, M., Flohi, P„ Gordon, T. t Maillard, K., Giddings, I., 
Brown, C, Bagherzadeh, A„ Campbell, C, Shipley, J., Wooster, R., and Cooper! 
C 5. Identification of amplified and expressed genes in breast cancer by comparative 
hybridization onto microarrays of randomly selected cDNA clones. Genes Chromo- 
somes Cancer, 34: 104-1 14, 2002, 

21. Varis, A„ Wolf, M., Monni, 0„ Vakkari, M. L, Kokkola, A., Mosfcaluk, C, Frierson, 
H., Powell, S. M., Knuutila, S., KalHonicrni, A., and El-Rifai, W. Targets of gene 
amplification and overexpression at 1 7q in gastric cancer. Cancer Res 62 • 2625- 
2629, 2002. 

22. Hughes, T. R_, Roberts, C. J., Dai, H„ Jones, A. R., Meyer, M. R„ Slade, D., 
Burchard, J^ Dow, S„ Ward, T, R., Kido% M, J^ Friend, S. H., and Marlon M. S. 



GENE EXPRESSION PATTERNS IN BREAST CANCER 



Widespread ancuploidy revealed by DNA microarray expression profiling. Nat. 
Genet., 25: 333-337, 2000. 

23. Vinaneva, K., Wright, F. A., Tanner, S. M., Yuan, B., Lemon, W, J., Caligiuri, M. A., 
Bloomficld, C D., dc La Chapclle, A., and Krahe, R. Expression profiling reveals 
fundamental biological differences in acute myeloid leukemia with isolated trisomy 8 
and normal cytogenetics. Proc. Natl. Acad. Sci. USA, 98: J 124— J 129, 2001. 

24. Phillips, J. L., Hayward, S. W., Wang. Y., Vasselli, J„ Pavlovich, C, PadilU-Nash, 
H„ Pczullo, J. R., Ghadimi, B. M., Grossfeld, G. Rivera, A., Liochan, W. M., 
Cunha, G. R., and Ricd. T. The consequences of chromosomal ancuploidy on gene 
expression profiles in a cell line model for prostate carcinogenesis. Cancer Res., 67: 

. 8143-8149,2001. 

25. Barlund, M n Tirkkonen. M., Forozan, F., Tanner, M. M., Kallioniemi, 0. P., and 
Kallioniemi, A. Increased copy number at 1 7q22- q24 by CGH in breast cancer is due 
to high.- level amplification of two separate regions. Genes Chromosomes Cancer, 20: 
372-376, 1997. 

26. Tanner, M. M„ Tirkkonen, M., Kallioniemi, A, Isola, J., Kuukasjarvi, T, Collins, C, 
Kowbel, D. ( Guan, X. Y,. Trenx, I., Gray, J. W.. Mcltzer, P., and Kallioniemi O, P. 
Independent amplification and frequent co-amplification of three nonsyntenic regions 



on the long arm of chromosome 20 in human breast cancer. Cancer Res., $6: 
3441-3445. 1996. 

27. Cillo, C, Faiella, A., Cantilc, M„ and BoncinclU, E. Homeobox genes and cancer 
Exp. Cell Res, 248: 1-$, 1999. 

28. Cillo, C, Cantile, M., Faiclla, A„ and Boncinelli, E. Homeobox genes in normal and 
malignant cells. J. Cell. Physiol., J88: 161-169, 2001. 

29. Care, A., Silvani, A., Meccia, E, Mattia, G., Stoppacciaro, A, Parmiani, G., Peschlc, 
C, and Colombo, M. P. HOXB7 constiturjvely activates basic fibroblast growth 
factor in melanomas. Mol. Cell, Biol., 16: 4842-4851, 1996. 

30. Care, A, Silvani, A., Mcccia, E., Mattia, G.„ Peschle, C, and Colombo, M. P. 
Transduction of the SkBr3 breast carcinoma celt line with the HOXB7 gene induces 
bFGF expression, increases cell proliferation and reduces growth factor dependence. 
Oncogene, 16: 3285-3289, 3998. 

31. Care, A., Felicctti, F„ Mcccia, E., Bottero, L., Parenza, M„ Stoppacciaro, A., Peschle, 
C, and Colombo, M, P. HOXB7: a key factor for tumor-associated angiogenic switch. 
Cancer Res., 61; 6532-6539, 2001. 

32. Naora, H.. Yang, Y. Q„ Montz, F. L, Seidman, J. D., Kurman, R. J., and Roden, R. B. 
A serologically identified tumor antigen encoded by a homeobox gene promotes 
growth of ovarian epithelial cells. Proc. NatL Acad Sci. USA, 98: 4060-4065, 2001. 



6245 



Microarray analysis reveals a major direct role of 
DNA copy number alteration in the transcriptional 
program of human breast tumors 

Jonathan R. Pollack***, Therese Sortie 5 , Charles M, Perou', Christian A. Rees"**, Stefanie S, Jeffrey", Per L Lonning**, 
Robert Tibshirani 55 , David Botstein", Anne-Use Barresen-Dale 6 , and Patrick O. Brown 1 ™ 

Departments of * Pathology, ^Genetics, "Surgery, "Hearth Research and Policy, and "Biochemistry, and f Howard Hughes Medical Institute, Stanford 
University School of Medicine, Stanford, CA 94305; department of Genetics, Norwegian Radium Hospital, Montebello, N-0310 Oslo, Norway; 
**Department of Medicine (Oncology), Haukeland University Hospital, N-5021 Bergen, Norway; and ^Department of Genetics and Uneberger 
Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC 27599 



Contributed by Patrick O. Brown, August 6, 2002 

Genomic DNA copy number alterations are key genetic events in 
the development and progression of human cancers. Here we 
report a genome-wide microarray comparative genomic hybrid- 
ization (array CGH) analysis of DNA copy number variation in 
a series of primary human breast tumors. We have profiled DNA 
copy number alteration across 6,691 mapped human genes, in 44 
predominantly advanced, primary breast tumors and 10 breast 
cancer cell lines. White the overall patterns of DNA amplification 
and deletion corroborate previous cytogenetic studies, the high- 
resolution (gene-by-gene) mapping of amplicon boundaries and 
the quantitative analysis of amplicon shape provide significant 
improvement in the localization of candidate oncogenes- Parallel 
microarray measurements of mRNA levels reveal the remarkable 
degree to which variation in gene copy number contributes to 
variation in gene expression in tumor cells. Specifically, we find 
that 62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number influences gene ex- 
pression across a wide range of DNA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, 
a 2-fold change in DNA copy number is associated with a corre- 
sponding 1.5-fold change in mRNA levels, and that overall, at least 
12% of all the variation in gene expression among the breast 
tumors is directly attributable to underlying variation in gene copy 
number. These findings provide evidence that widespread DNA 
copy number alteration can lead directly to global deregulation of 
gene expression, which may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, including comparative 
genomic hybridization (CGH) (l) t have led to the identifi- 
cation of a number of recurrent regions of DNA copy number 
alteration in breast cancer cell lines and tumors (2-4). While 
some of these regions contain known or candidate oncogenes 
[eg., FGFR1 (8pll), MYC (8q24), CCND1 (llql3), ERBB2 
(17ql2), and ZNF217 (20ql3)j and tumor suppressor genes 
[RBI (13ql4) and TP53 (17pl3)]> the relevant gene(s) within 
other regions (e.g., gain of lq, 8q22, and 17q22-24, and loss of 
Sp) remain to be identified. A high-resolution genome-wide 
map, delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer. In this study, we have created such a map, using 
array-based CGH (5-7) to profile DNA copy, number alteration 
in a series of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
DNA copy number changes that we and others have identified 
in breast tumors alter expression of genes within involved 
regions. Because we had measured mRNA levels in parallel in 
the same samples (8), using the same DNA microarrays, we had 
an opportunity to explore on a genomic scale the relationship 
between DNA copy number changes and gene expression. From 

www.pnas.org/cgi/dol/ 10. 1073/pnas. 16247 1 999 



this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. 

Materials and Methods 

Tumors and Cell Lines. Primary breast tumors were predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, with more than 50% being lymph node positive. The 
fraction of tumor cells within specimens averaged at least 50%. 
Details of individual tumors have been published (8, 9), and 
are summarized in Table 1, which is published as supporting 
information on the PNAS web site, www.pnas.org. Breast cancer 
cell lines were obtained from the American Type Culture 
Collection. Genomic DNA was isolated either using Qiagen 
genomic DNA columns, or by phenol/chloroform extraction 
followed by ethanol precipitation. 

DNA Labeling and Microarray Hybridizations. Genomic DNA label- 
ing and hybridizations were performed essentially as described 
in Pollack et aL (7), with slight modifications. Two micrograms 
of DNA was labeled in a total volume of 50 microliters and the 
volumes of all reagents were adjusted accordingly. "Test" DNA 
(from tumors and cell lines) was f luorescently labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genes (i.e., UniGene clusters). The 
"reference" (labeled with Cy3) for each hybridization was nor- 
mal female leukocyte DNA from a single donor. The fabrication 
of cDNA microarrays and the labeling and hybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GenePix scanner (Axon Instruments, Foster City, CA), and 
fluorescence ratios (test/reference) calculated using scanalyze 
software (available at http://rana.lbl.gov). Fluorescence ratios 
were normalized for each array by setting the average log 
fluorescence ratio for all array elements equal to 0, Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable. DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimating 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DNA copy number profiles are 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions for arrayed human cDNAs were assigned by 
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Fig. 1. Genome-wide measurement of DNA copy number alteration by array CGH. <a) DNA copy number profiles are illustrated for cell lines containing different 
numbers of X chromosomes, for breast cancer cell lines, and for breast tumors. Each row represents a different cell line or tumor, and each column represents 
one of 6,691 different mapped human genes present on the microarray, ordered by genome map position from Ipterthrough Xqter. Moving average (symmetric 
5-nearest neighbors) fluorescence ratios (test/reference) are depicted using a logz-based pseudocolor scale (indicated), such that red luminescence reflects 
fold-amplification, green luminescence reflects fold-deletion, and black indicates no change (gray indicates poorly measured data). (6) Enlarged view of DNA 
copy number profiles across the X chromosome, shown for cell lines containing different numbers of X chromosomes. 



identifying the starting position of the best and longest match of 
any DNA sequence represented in the corresponding UniGene 
cluster (10) against the "Golden Path*' genome assembly 
(http://genome.ucsc.edu/; Oct 7, 2000 Freeze). For UniGene 
clusters represented by multiple arrayed elements, mean fluo- 
rescence ratios (for all elements representing the same UniGene 
cluster) are reported. For mRNA measurements, fluorescence 
ratios are "mean-centered" (i.e., reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

We performed CGH on 44 predominantly locally advanced, 
primary breast tumors and 10 breast cancer cell lines, using 
cDNA microarrays containing 6,691 different mapped human 
genes (Fig. la; also see Materials and Methods for details of 
microarray hybridizations). To take full advantage of the im- 
proved spatial resolution of array CGH, we ordered (fluores- 
cence ratios for) the 6,691 cDNAs according to the "Golden 
Path" (http://genome.ucsc.edu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (e.g., 
candidate oncogenes within amplicons), but also provide precise 
genetic landmarks for chromosomal regions of amplification and 



deletion. Parallel analysis of DNA from cell lines containing 
different numbers of X chromosomes (Fig. lb), as we did before 
(7), demonstrated the sensitivity of our method to detect single- 
copy loss (45, XO), and 1.5- (47,XXX), 2- (48 T XXXX), or 
2.5-fold (49,XXXXX) gains (also see Fig. 5, which is published 
as supporting information on the PNAS web site). Fluorescence 
ratios were linearly proportional to copy number ratios, which 
were slightly underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(Fig. la), detected in the tumors despite the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell line- and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and loss were readily iden- 
tifiable. For example, gains within lq, 8q, 17q, and 20q were 
observed in a high proportion of breast cancer ceU lines/tumors 
(90%/69%, 100%/47%, 100%/60%, and 90%/44%, respective- 
ly), as were losses within lp, 3p, 8p, and 13q (80%/24%, 
80%/22%, 80%/22%, and 70%/18%, respectively), consistent 
with published cytogenetic studies (refe. 2-4; a complete listing 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information on the PNAS web site). The total 
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Fig. 2, DNA copy number alteration across chromosome 8 by array CGH. (a) DNAcopy number profiles are illustrated for cell lines containing different numbers 
of X chromosomes, for breast cancer cell lines, and for breast tumors. Breast cancer ceil lines and tumors are separately ordered by hierarchical clustering to 
highlight recurrent copy number changes. The 241 genes present on the microarrays and mapping to chromosome 8 are ordered by position along the 
chromosome. Fluorescence ratios (test/reference) are depicted by a 1092 pseudocolor scale (indicated). Selected genes are indicated with color-coded text (red, 
increased; green, decreased; black, no change; gray, not well measured) to reflect correspondingly altered mRNA levels (observed in the majority of the subset 
of samples displaying the DNA copy number change). The map positions for genes of interest that are not represented on the microarray are indicated in the 
row above those genes represented on the array. (6) Graphical display of DNA copy number profile for breast cancer cell line SK&R3. Fluorescence ratios 
(tumor /normal) are plotted on a log2 scale for chromosome 8 genes, ordered along the chromosome. 



number of genomic alterations (gains and Josses) was found to 
be significantly higher in breast tumors that were high grade (P - 
0.008), consistent with published CGH data (3), estrogen recep- 
tor negative (P = 0.04), and harboring TP53 mutations (P = 
0.0006) (see Table 4, which is published as supporting informa- 
tion on the PNAS web site). 

The improved spatial resolution of our array CGH analysis is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 2a). The complexity of amplicon 
structure is most easily appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 2b). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 17 and 20, can be 
found in Figs. 6 and 7, which are published as supporting 
information on the PNAS web site). 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively), and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel by using cDNA 
microarrays (8). The parallel assessment of mRNA levels is 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident in an examination of the pseudocolor representations 
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Fig. 3. Concordance between DNA copy number and gene expression across chromosome 17. DNA copy number alteration {Upper) and mRNA levels (Lower) 
are illustrated for breast cancer cell lines and tumors. Breast cancer cell lines and tumors are separately ordered by hierarchical clustering (Upper), and the 
identical sample order is maintained {Lower). The 354 genes present on the microarrays and mapping to chromosome 17, and for which both DNA copy number 
and mRNA levels were determined, are ordered by position along the chromosome; selected genes are indicated in color-coded text (see Fig. 2 legend). 
Fluorescence ratios (test/reference) are depicted by separate log* pseudocolor scales (indicated). 



of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene amplification and 
elevated gene expression are quite concordant; i.e., a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide 7 of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
genes), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). 

To determine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. Aa). For both the 



breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (P values for pair-wise Student's 
t tests comparing adjacent classes: cell lines, 4 X 10~ 49 , 1 x 10~ 49 , 
5 x 10" 5 , 1 X 1(T 2 ; tumors, 1 X lO" 43 , 1 X 1G~ 214 , 5 X 1{T 4 \ 
1 x 10 ~ 4 ). A linear regression of the average log(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a 2-fold change in DNA copy 
number was accompanied by 1.4- and 1.5-fold changes in mRNA 
level for the breast cancer cell lines and tumors, respectively (Fig. 
4a, regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 46). 
The distribution of correlations forms a normal-shaped curve, 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistically significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig, 4b) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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Fig. A. Genome-wide influence of ON A copy number alterations on mRNA levels, (a) For breast cancer cell lines (gray) and tumor samples (black), both 
mean-centered mRNA fluorescence ratio (log 2 scale) quartiles (box plots indicate 25th, 50th, and 75th percentile) and averages (diamonds; V-value error bars 
indicate standard errors of the mean) are plotted for each of five classes of genes, representing DNA deletion (tumor/normal ratio < 0.8), no change (0.8-1.2), 
low- (1.2-2), medium- (2-4), and high-level (>4) amplification, P values for pair-wise Student's t tests, comparing averages between adjacent classes (moving 
left to right), are 4 X 10" a9 , 1 X 10* w , 5 X 10~ 5 , 1 X 10"* (cell lines), and 1 x 10" 43 , 1 x 10' 2 ' 4 , 5 x 10~ 4 \ 1 x 10~* (tumors), (b) Distribution of correlations between 
DNA copy number and mRNA levels, for 6,095 different human genes across 37 breast tumor samples, (c) Plot ^f observed versus expected correlation coefficients. 
The expected values were obtained by randomization of the sample labels in the DNA copy number data set The line of unity is indicated, (cr) Percent variance 
in gene expression (among tumors) directly explained by variation in gene copy number. Percent variance explained (black line) and fraction of data retained 
(gray line) are plotted for different fluorescence intensity /background (a rough surrogate for signal/noise) cutoff values. Fraction of data retained is relative 
to the 1.2 intensity /background cutoff, Details of the linear regression model used to estimate the fraction of variation in gene expression attributable to 
underlying DNA copy number alteration can be found in the supporting information (see Estimating the Fraction of Variation in Gene Expression Attributable 
to Underlying DNA Copy Number Alteration). 



tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overaJJ, about 
1% of all of the observed variation in mRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. Ad), We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. 4d). This still undoubtedly 
represents a significant underestimate, as the observed variation 
in global gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non- tumor cell types within clinical 
samples. 

Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
usefulness of defining amplicon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 



cell lines and rumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be generalizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes). 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement with our findings, Phillips etol. (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Piatzer et al. (15) recently reported that in metastatic 
colon tumors only —4% of genes within amplified regions were 
found more highly (>2-fold) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Platzer ei al (15) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable that only 14 transcripts of many thousand 
residing within unamplified chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity amplicons, effectively overcalling re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific autoreguiation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. In our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape [to identify the "driving" gene(s) 
within amplicons (16)], on a large number of samples, in addition 
to functional studies. Fourth, this finding suggests that analyzing 



the genomic distribution of expressed genes, even within existing 
microarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding implies that a substantial portion of the phenotypic 
uniqueness (and by extension, the heterogeneity in clinical 
behavior) among patients* tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this finding supports 
a possible role for widespread DNA copy number alteration in 
tumorigenesis (17, 18), beyond the amplification of specific 
oncogenes and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stochiometric relationships in cell metabolism and physiology 
(e.g., proteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression. Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Torben F. 0rntoftf§, Thomas ThykjaerTI, Frederic M. Waldman||, Hans Wolf**, 
and Julio E. Celis±± 



Gain and loss of chromosomal material is characteristic 
of bladder cancer, as well as malignant transformation in 
general. The consequences of these changes at both the 
transcription and translation levels is at present unknown 
partly because of technical limitations. Here we have at- 
tempted to address this question in pairs of non-invasive 
and invasive human bladder tumors using a combination 
of technology that included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes), and high resolution 



phenomenon at both the transcription and translation levels. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there is a correlation between 
DNA copy numbers and gene expression in highly amplified 
areas (2), and studies of individual genes in solid tumors 
have revealed a good correlation between gene dose and 
mRNA or protein levels in the case of c-erb-B2, cyclin d1, 
ems1 t and N-myc (3-5). However, a high cyclin D1 protein 
expression has been observed without simultaneous am- 



two-dimensional gel electrophoresis/Yhe results showed^^™ ( 4 >. and a low ,evel of 0 ' m V c C °PV number in " 
that there is a gene dosage effect ShsA in some cases crease was observed without concomitant c-myc protein 



superimposes on other regulatory mechanisms. This ef- 
fect depended (p < 0.015) on the magnitude of the com- 
parative genomic hybridization change. In general {18 of 
23 cases), chromosomal areas with more than 2-fold gain 
of DNA showed a corresponding increase in mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels^ Be- 
cause most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and 
protein alterations irjL relatively few cases of well focused 
abundant proteins. !with few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The implications, as well as limitations, 
of the approach are discussed, Molecular & Cellular 
Proteomics 1:37-45, 2002. 



Aneuploidy is a common feature of most human cancers 
(1), but little is known about the genome-wide effect of this 
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overexpression (6). 

In human bladder tumors, karyotyping, fluorescent in situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of non-invasive pTa transitional cell carcinomas (TCCs), 
this includes loss of chromosome 9 or parts of it, as well as 
loss of Y in males. In minimally invasive pT1 TCCs, the fol- 
lowing alterations have been reported: 2q-, 11 p-, 1q+, 
11q13+, 17q+, and 20q+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes arid onco- 
genes; however, the large chromosomal areas involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult. 

In this investigation we have combined genome-wide tech- 
nology for detecting genomic gains and losses (CGH) with 
gene expression profiling techniques (microarrays and pro- 
teomics) to determine the effect of gene copy number on 
transcript and protein levels in pairs of non-invasive and in- 
vasive human bladder TCCs. 

EXPERIMENTAL PROCEDURES 

Material— Bladder tumor biopsies were sampled after informed 
consent was obtained and after removal of tissue for routine pathol- 
ogy examination. By light microscopy tumors 335 and 532 were 
staged by an experienced pathologist as pTa (superficial papillary), 

1 The abbreviations used are: CGH, comparative genomic hybrid- 
ization; TCC, transitional ceil carcinoma; LOH, loss of heterozygosity; 
PA-FABP, psoriasis-associated fatty acid-binding protein; 2D, 
two-dimensional. 
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Fig. 1 . DNA copy number and mRNA expression level. Shown from left to right are chromosome (Chn), CGH profiles, gene location and 
expression level of specific genes, and overall expression level along the chromosome. A, expression of mRNA in invasive tumor 733 as 
compared with the non-invasive counterpart tumor 335. 8, expression of mRNA in invasive tumor 827 compared with the non-invasive 
counterpart tumor 532. The average fluorescent signal ratio between tumor DNA and normal DNA is shown along the length of the chromosome 
(left). The bold curve in the ratio profile represents a mean of four chromosomes and is surrounded by thin curves indicating one standard 
deviation. The central vertical fine (broken) indicates a ratio value of 1 (no change), and the vertical lines next to it (dotted) indicate a ratio of 
0.5 (left) and 2.0 (rig/if). In chromosomes where the non-invasive tumor 335 used for comparison showed alterations in DNA content, the ratio 
profile of that chromosome is shown to the right of the invasive tumor profile. The colored bars represents one gene each, identified by the 
running numbers above the bars (the name of the gene can be seen at www.MDLDK/sdatahtml), The bars indicate the purported location of 
the gene, and the colors indicate the expression level of the gene in the invasive tumor compared with the non-invasive counterpart; > 2-fold 
increase (black), >2-fold decrease (blue), no significant change (orange). The bar to the far right, entitled Expression shows the resulting change 
in expression along the chromosome; the colors indicate that at least half of the genes were up-regulated (black), at least half of the genes 
down-regulated (blue), or more than half of the genes are unchanged (orange). If a gene was absent in one of the samples and present in 
another, it was regarded as more than a 2-fold change. A 2-fold level was chosen as this corresponded to one standard deviation in a double 
determination of —1800 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade I and II, respectively, tumors 733 and 827 were staged as pT1 
(invasive into submucosa), 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparation —Tissue biopsies, obtained fresh from surgery, 
were embedded immediately in a sodium-guanidinlum thiocyanate 
solution and stored at -80 Q C. Total RNA was isolated using the 
RNAzol B RNA isolation method (WAK-Chemie Medical GMBH). 
poly(A) + RNA was isolated by an oligo(dT) selection step (Otigotex 
mRNA kit; Qiagen). 

cRNA Preparation— 1 ^.g of mRNA was used as starting material. 
The first and second strand cDNA synthesis was performed using the 
Superscript® choice system (Invitrogen) according to the manufac- 
turer's instructions but using an oligo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrip® in vitro transcription kit (Ambion). Biotin-labeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs in the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Qiagen). 

Array Hybridization and Scanning— Array hybridization and scan- 
ning was modified from a previous method (13), 10 ^g of cRNA was 
fragmented at 94 °C for 35 min in buffer containing 40 rrtM Tris 
acetate, pH 8.1, 100 mM KOAc, 30 mM MgOAc. Prior to hybridization, 
the fragmented cRNA in a 6x SSPE-T hybridization buffer (1 m NaCl, 
10 mM Tris, pH 7.6, 0.005% Triton), was heated to 95 °C for 5 min, 
subsequently cooled to 40 a C, and loaded onto the Affymetrix probe 
array cartridge. The probe array was then incubated for 16 h at 40 °C 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes in 6x SSPE-T at 25 °C followed by 4 washes in 0.5X SSPE-T 
at 50 °C. The biotinylated cRNA was stained with a streptavidin- 
phycoerythrin conjugate, 1 0 fig/ml (Molecular Probes) in 6x SSPE-T 
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Fig. 1 — continued 



for 30 min at 25 °C followed by 1 0 washes in 6 x SSPE-T at 25 °C. The 
probe arrays were scanned at 560 nm using a confocal laser scanning 
microscope (made for Affymetrix by Hewlett-Packard). The readings 
from the quantitative scanning were analyzed by Affymetrix gene 
expression analysis software. 

Microsateltite Analysis— Microsatellite Analysis was performed as 
described previously (14). Microsatellites were selected by use of 
www.ncbi.nlJ7i.nih.gov/genemap98, and primer sequences were ob- 
tained from the genome data base at www.gdb.org. DNA was extracted 
from tumor and blood and amplified by PCR in a volume of 20 pd for 35 
cycles. The amplicons were denatured and electrophoresed for 3 h in an 
ABI Prism 377. Data were collected in the Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as less than 33% 
of one allele detected in tumor amplicons compared with blood. 

Proteomic Analysis— TCCs were minced into small pieces and 
homogenized in a small glass homogenizer in 0.5 ml of lysis solution. 
Samples were stored at -20 °C until use. The procedure for 2D gel 
electrophoresis has been described in detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or Coomassie Brilliant Blue. Pro- 
teins were Identified by a combination of procedures that included 
microsequencing, mass spectrometry, two-dimensional gel Western 
immunoblotting, and comparison with the master two-dimensional gel 
image of human keratinocyte proteins; see biobase.dk/cgi-bin/celis. 

CGH— Hybridization of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Fluorescein-labeled tumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 jxg) were 
denatured at 37 °C for 5 min and applied to denatured normal met- 
aphase slides. Hybridization was at 37 °C for 2 days. After washing, 
the slides were counterstained with 0.15 /xg/ml 4,6-diamidino-2-phe- 
nylindole in an anti-fade solution. A second hybridization was per- 
formed for all tumor samples using fluorescein-labeled reference DNA 
and Texas Red -labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during the initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital image analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and tosses. The average 
green:red fluorescence intensity ratio profiles were calculated using 
four images of each chromosome (eight chromosomes total) with 
normalization of the green:red fluorescence intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4,6-dtamidino-2-phenylindole band- 
ing patterns. Only images showing uniform high intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
matic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization—The CGH analysis 
identified a number of chromosomal gains and losses in the 
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Table I 

Correlation between alterations detected by CGH and by expression monitoring 

Top, CGH used as independent variable (if CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable (if expression alteration - what GGH deviation was found). 



CGH alterations 



Tumor 733 vs. 335 
Expression change clusters 



Concordance 



CGH alterations 



Tumor 827 vs. 532 
Expression change clusters 



Concordance 



13 Gain 



10 Loss 



10 Up-reguiation 

0 Down-regulation 

3 No change 

1 Up-regulation 

5 Down-regulation 

4 No change 



77% 



50% 



10 Gain 



12 Loss 



8 Up-regulation 
0 Down-regulation 

2 No change 

3 Up-regulation 

2 Down regulation 
7 No change 



80% 



17% 



Expression change clusters 


Tumor 733 vs. 335 


Concordance 


Expression change clusters 


Tumor 827 vs, 532 


Concordance 


CGH alterations 


CGH alterations 


16 Up-regulation 


11 Gain 


69% 


17 Up-regulation 


10 Gain 


59% 




2 Loss 






5 Loss 






3 No change 






2 No change 




21 Down-regulation 


1 Gain 


38% 


9 Down-regulation 


0 Gain 


33% 




8 Loss 






3 Loss 






12 No change 






6 No change 




15 No change 


3 Gain 


60% 


21 No change 


1 Gain 


81% 




3 Loss 






3 Loss 






9 No change 






17 No change 





two invasive tumors (stage pT1 , TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33-, and X-, and 7+ f 9q-, 
and Y-, respectively. Both invasive tumors showed changes 
(1q22-24+, 2q14.1-qter-, 3q12-q13.3- ( 6q12-q22-, 
9q34+, 11q12-q13+, 17+, and 20q1 1.2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are shown in Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 in TCC 733 (Fig. 1A) and 
20q12 in TCC 827 (Rg. 1S). 

mRNA Expression in Relation to DNA Copy Number— The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-invasive counterparts 
(TCCs 532 and 335). This was done in two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1 ,800 genes that yielded a signal on the arrays 
were searched in the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1 096) were plotted as bars covering their purported locus, in 
that way it was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the invasive versus the non-invasive counterpart. Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as informative (Rg. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the QGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
increased mRNA expression levels in the two tumor pairs (Fig. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an increased level of transcripts in both 
TCCs 733 (77%) and 827 (80%) (Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often regis- 
tered as having unaltered RNA levels (Table i, fop). The inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2-foid as the independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected in areas with unaltered CGH ratios (Table I, bottom). 
Furthermore, as a control we looked at areas with no alter- 
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Fig. 2. Correlation between maximum CGH aberration and the ability to detect expression change by oligonucleotide array 
monitoring. The aberration is shown as a numerical -fold change in ratio between invasive tumors 827 (A) and 733 (♦) and their non-invasive 
counterparts 532 and 335. The expression change was taken from the Expression line to the right in Fig. 1 , which depicts the resulting 
expression change for a given chromosomal region. At (east half of the mRNAs from a given region have to be either up- or down-regulated 
to be scored as an expression change. Ail chromosomal arms in which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of one were included. 



ation in expression. No alteration was detected by CGH in 
most of these areas (TCC 733, 60% and TCC 827, 81 %; see 
Table I, bottom). Because the ability to observe reduced or 
increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations in the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays {Fig. 2)(£or both tumors TCC 733 (p < 0.015) and 
TCC 827 {p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the DNA copy number) and alterations detected by the array 
based technology (Fig. 2^ Similar data were obtained when 
areas with altered expression were used as independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1 .6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction in expres- 
sion level, which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent. 

Microsatellite-based Detection of Minor Areas of Loss- 
es—In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1 , TCC 733 
chromosome 1q32 t 2p21 f and 7q21 and q32, 9q34, and 
1 0q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two microsatellites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Fig. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/in- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. 1^). As indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
implying -that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827 , the telomeric end of chro- 
mosome 11p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellites (D11S1760, D11S922) 
positioned close to MUC2, IGF2, and cathepsin D indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11 t 12p12.2, 12q21.1, and 16q24 
and in TCC 827 at chromosome 11p15.5 ? 12p11, 15q11.2, 
and 1 8q1 2 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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Fig. 3. Microsatellite analysis of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu class I histocompatibility antigen (gene 
number 38 in Rg. 1), (b) by D1S2735 close to cathepsin E (gene 
number 41 in Fig. 1), and (c) at chromosome 2p23 by D2S2251 close 
to general ^-spectrin (gene number 1 1 on Fig. 1) and of (d) tumor 827 
showing loss of heterozygosity at chromosome 18q12 by S18S11 18 
close to mitochondrial 3-oxoacyl-coenzyme A thiolase (gene number 
12 in Fig. 1). The upper curves show the electropherogram obtained 
from normal DNA from leukocytes (A/), and the lower curves show the 
electropherogram from tumor DNA (J). In all cases one allele is 
partially lost in the tumor ampHcon. 

showing reduced mRNA transcripts. Only the microsatellite 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional down-regulation of genes in the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels— 
2D-PAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating in areas away from the edges of the pH 
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Fig. 4. Correlation between protein levels as judged by 20- 
PAGE and transcript ratio. For comparison proteins were divided in 
three groups, unaltered in level or up- or down-regulated {horizontal 
axis). The mRNA ratio as determined by oligonucleotide arrays was 
plotted for each gene (vertical axis). mRNAs that were scored as 
present in both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent in the invasive tumors (along horizontal axis) or 
as absent in non-invasive reference {top of figure). Two different 
scalings were used to exclude scaling as a confounder, TCCs 827 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 (#0) were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences in mRNA ratios 
between the groups. Proteins shown, were as follows: Group A (from 
left), phosphogiucomutase 1, glutathione transferase class jx number 
4, fatty acid-binding protein homologue, cytokeratin 15, and cyto- 
keratin 13; B (from left), fatty acid-binding protein homologue, 28-kDa 
heat shock protein, cytokeratin 13, and calcyclin; Ctfrom left), a-eno- 
lase, hnRNP B1, 28-kDa heat shock protein, 14-3-3-e, and 
pre-mRNA splicing factor; D, mesothelial keratin K7 (type II); E (from 
top), glutathione S-transferase-ir and mesothelial keratin K7 (type il); 
F(from top and left), adenyryl cyclase-associated protein, E-cadherin, 
keratin 19, calgizzarin, phosphoglycerate mutase, annexin IV, cy- 
toskeletal y-actin, hnRNP A1, integral membrane protein calnexin 
(IP90). hnRNP H, brain-type clathrin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucieoprotein A/B, 
translationaily controlled tumor protein, liver giyceraldehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na.K- 
ATPase 0-1 subunit; G, (from top and /eft), TCP20, calgizzarin, 70- 
kDa heat shock protein, calnexin, hnRNP H, cytokeratin 15, ATP 
synthase, keratin 1 9, triosephosphate isomerase, hnRNP F, liver glyc- 
eraldehyde-3-phosphatase dehydrogenase, glutathione S-transfer- 
ase-ir, and keratin 8; H (from /eft), plasma gelsolin, autoantigen cal- 
reticulin, thioredoxin, and NAD+-dependent 15 hydroxyprostaglandin 
dehydrogenase; / (from fop), prolyl 4-hydroxylase /3-subunit, cyto- 
keratin 20, cytokeratin 17, prohibition, and fructose 1,6-biphos- 
phatase; J annexin II; K, annexin IV; L (from top and left), 90-kDa heat 
shock protein, prolyl 4-hydroxytase 0-subunit, a-enolase, GRP 78, 
cyclophilin, and cofilin. 

gradient, and having a known chromosomal location, were 
selected for analysis In the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Fig. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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Fig. 5. Comparison of protein and transcript levels in invasive 
and non-invasive TCCs. The upper part of the figure shows a 2D gel 
(left) and the oligonucleotide array (righfy of TCC 532. The red rectan- 
gles on the upper gel highlight the areas that are compared below. 
Identical areas of 2D gels of TCCs 532 and 827 are shown below. 
Clearly, cytokeratins 13 and 15 are strongly down-regulated in TCC 
827 (red annotation). The tile on the array containing probes for 
cytokeratin 1 5 is enlarged below the array (red arrow) from TCC 532 
and is compared with TCC 827. The upper row of squares in each tiie 
corresponds to perfect match probes; the tower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
specific binding). Absence of signal is depicted as black, and the 
higher the signal the lighter the color. A high transcript level was 
detected in TCC 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cytokeratin 13. a high 
transcript level was also present in TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the bottom of the figure [left] show levels of PA-FABP and adipocyte- 
FABP in TCCs 335 and 733 (invasive), respectively. Both proteins are 
down-regulated in the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript. A medium transcript level was de- 
tected in the case of TCC 335 (1277 units) whereas very low levels 
were detected in TCC 733 (166 units). IEF, isoelectric focusing. 



keratins encoded by genes on chromosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected in TCCs 733 and 335, and of these 19 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex- 
pressed in the non-invasive TCC 335 but lost in the invasive 
counterpart (TCC 733; see Fig. 5). The smaller number of 
proteins detected in both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

1 1 chromosomal regions where CGH showed aberrations 
that corresponded to the changes in transcript levels also 
showed corresponding changes in the protein ievei (Table li). 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexins II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1 , Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area in invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene expression in two pairs of non-invasive 
and invasive TCCs using high throughput expression arrays 
and proteomics, in combination with CGH. In general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect. In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how- 



Table II 



Proteins whose expression level correlates with both mRNA and gene dose changes 


Protein 


Chromosomal location 


Tumor TCC 


CGH alteration 


Transcript alteration* 


Protein alteration 


Annexin H 


1q21 


733 


Gain 


Abs to Pres B 


Increase 


Annexin IV 


2p13 


733 


Gain 


3.9-Fold up 


Increase 


Cytokeratin 17 


17q12-q21 


827 


Gain 


3.8-Foid up 


Increase 


Cytokeratin 20 


17q21.1 


827 


Gain 


'5.6-Fold up 


Increase 


(PA-)FABP 


8q21.2 


827 


Loss 


1 0-Fold down 


Decrease 


. FBP1 


9q22 


827 


Gain 


2.3-Fold up 


Increase 


Plasma gelsolin 


9q31 


827 


Gain 


Abs to Pres 


Increase 


Heat shock protein 28 


15q12-q13 


827 


toss 


2.5-Fold up 


Decrease 


Prohibitin 


17q21 


827/733 


Gain 


3J-/2.5-Fold up 6 


Increase 


Prolyl-4-hydroxyl 


17q25 


827/733 


Gain 


5.7-/1 .6-Fold up 


Increase 


hnRNPBI 


7p15 


827 


Loss 


2.5-Fold down 


Decrease 



a Abs, absent; Pres. present. 

D In cases where the corresponding alterations were found in both TCCs 827 and 733 these are shown as 827/733. 
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ever, an increase or decrease in DNA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respectively. Some of these transcripts could not be 
detected in the non-invasive tumor but were present at rela- 
tively high levels in areas with DNA amplifications in the inva- 
sive tumors {e.g. in TCC 733 transcript from cellular ligand of 
annexin II gene (chromosome 1q21) from absent to 2670 
arbitrary units; in TCC 827 transcript from small proline-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material in this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie in the loss of controlled 
methylation in tumor cells (17-19). Thus, it may be possible 
that in chromosomes with increased DNA copy numbers two 
or more alleles could be demethylated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has documented a 
ploidy regulation of gene expression in yeast, but in this case all 
the genes were present in the same ratio (22), a situation that is 
not analogous to that of cancer ceils, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these include 9p-, 9q-, 1q+, Y- 
(2, 6), and in pT1 tumors, 2q-,11p- l 11q-, 1q+, 5p-f, 8q+, 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q- and Y-, respectively. Likewise, the two minimal invasive 
pT1 tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1 q22-24 
amplification (seen in both tumors), 11q14-q22 loss, the latter 
often linked to 17 q + (both tumors), and 1q+ and 9p- t often 
linked to 20q + and 11 q13+ (both tumors) (7-9). These ob- 
servations indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it is only possible to get a crude picture of 
chromosomal instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with increased copy numbers. Analysis of these regions 
by positioning heterozygous microsatellites as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity in several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA microarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploidy and cannot predict whether 
chromosomal gains wilt be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic imprinting has 
an impact on the expression level in normal cells and is often 
reduced in tumors. However, the relation between imprinting 
and gain of chromosomal material is not knowa 

We regard it as a strength of this Investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, mRNA and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to translational 
regulation, post-translationai processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated mRNA pools, which are associated with few 
translationally inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very important in the case of polypeptides with a short 
halMife (e.g. signaling proteins). A poor correlation between 
mRNA and protein levels was found in liver cells as deter- 
mined by arrays and 2D-PAGE (25), and a moderate correla- 
tion was recently reported by Ideker et a/. (26) in yeast, 
^Interestingly, our study revealed a much better correlation 
between gained chromosomal areas and increased mRNA 
levels than between loss of chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript^ One possible 
explanation could be that by losing one allele the change in 
mRNA level is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severalfold increase in gene copy number resulting in a much 
higher impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may in the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, rnRNA expression, and protein level. Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area in chromosome 17q. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated is the large 
extent of protein modification that occurs after translation, 
requiring tmmunoidentification and/or mass spectrometry to 
correctly identify the proteins in the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on microarrays with many thousand radiation 
hybrid -mapped genes will increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 2D gel analysis to obtain information at 
the post-translational level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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HER-2/neti Breast Cancer Predictive Testing 
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Each year, over 1 82,000 women in the United States are 
diagnosed with breast cancer, and approximately 45,000 die 
of the disease. 5 Incidence appears to be increasing in the 
United States at a rate of roughly 2% per year. The reasons 
for the increase are unclear, but non-genetic risk factors appear 
to play a large role. 2 

Five-year survival rates range from approximately 65%- 
85%, depending on demographic group, with a significant 
percentage of women experiencing recurrence of their cancer 
within 10 years of diagnosis. One of the factors most predic- 
tive for recurrence once a diagnosis of breast cancer has been 
made is the number of axillary lymph nodes to which tumor 
has metastasized. Most node-positive women are given adju- 
vant therapy, which increases their survival. However, 20%- 
30% of patients without axillary node involvement also 
develop recurrent disease, and the difficulty lies in how to iden- 
tify this high-risk subset of patients. These patients could 
benefit from increased surveillance, early intervention, and 
treatment. 

Prognostic markers currently used in breast cancer recur- 
rence prediction include tumor size, histological grade, steroid 
hormone receptor status, DNA ploidy, proliferative index, and 
cathepsin D status. Expression of growth factor receptors and 
over-expression of the HER-2/neu oncogene have also been 
identified as having value regarding treatment regimen and 
prognosis. 

HER-2/neu (also known as c-erbB2) is an oncogene that 
encodes a transmembrane glycoprotein that is homologous 
to, but distinct from, the epidermal growth factor receptor. 
Numerous studies have indicated that high levels of expres- 
sion of this protein are associated with rapid tumor growth, 
certain forms of therapy resistance, and shorter disease-free 
survival. The gene has been shown to be amplified and/or 
overexpressed in \0%-30% of invasive breast cancers and in 
40%-60% of intraductal breast carcinoma. 3 

There are two distinct FDA-approved methods by which 
HER-2/neu status can be evaluated: immunohistochemistry 
(IHC, HercepTest™) and FISH (fluorescent in situ hybridiza- 
tion, PathVysion™ Kit). Both methods can be performed on 
archived and current specimens. The first method allows visual 
assessment of the amount of HER-2/neu protein present on 
the cell membrane. The latter method allows direct quantifi- 
cation of the level of gene amplification present in the tumor, 
enabling differentiation between low- versus high-amplifica- 
tidh. At least one study has demonstrated a difference in 



recurrence risk in women younger than 40 years of age for 
low- versus high-amplified tumors (54.5% compared to 
85.7%); this is compared to a recurrence rate of 16.7% for 
patients with no HER-2/neu gene amplification. 4 HER-2/neu 
status may be particularly important to establish in women with 
small (< 1 cm) tumor size. 

The choice of methodology for determination of HER-2/ 
neu status depends in part on the clinical setting. FDA approval 
for the Vysis FISH test was granted based on clinical trials 
involving 1549 node-positive patients. Patients received one 
of three different treatments consisting of different doses of 
cyclophosphamide, Adriamycin, and 5-fluorouracil (CAF). 
The study showed that patients with amplified HER-2/neu 
benefited from treatment with higher doses of adriamycin- 
based therapy, while those with normal HER-2/neu levels did 
not. The study therefore identified a sub- set of women, who 
because they did not benefit from more aggressive treatment, 
did not need to be exposed to the associated side effects. In 
addition, other evidence indicates that HER-2/neu amplifica- 
tion in node-negative patients can be used as an independent 
prognostic indicator for early recurrence, recurrent disease at 
any time and disease-related death. 5 Demonstration of HER- 
2/neu gene amplification by FISH has also been shown to be 
of value in predicting response to chemotherapy in stage-2 
breast cancer patients. 

Selection of patients for Herceptin® (Trastuzumab) mono- 
clonal antibody therapy, however, is based upon demonstra- 
tion of HER-2/neu protein overexpression using HercepTest™. 
Studies using Herceptin® in patients with metastatic breast 
cancer show an increase in time to disease progression, 
increased response rate to chemotherapeutic agents and a small 
increase in overall survival rate. The FISH assays have not yet 
been approved for this purpose, and studies looking at response 
to Herceptin® in patients with or without gene amplification 
status determined by FISH are in progress. 

In general, FISH and IHC results correlate well. However, 
subsets of tumors are found which show discordant results; 
i.e., protein overexpression without gene amplification or lack 
of protein overexpression with gene amplification- The clini- 
cal significance of such results is unclear. Based on the above 
considerations, HER-2/neu testing at SHMC/PAML will uti- 
lize immunohistochemistry (HercepTest 0 ) as a screen, fol- 
lowed by FISH in IHC-negative cases. Alternatively, either 
method may be ordered individually depending on the clini- 
cal setting or clinician preference. 
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CPT code information 

HER-2/neuviaIHC 

88342 (including interpretive report) 

HER-2/neu via FISH 

8827 1 x2 Molecular cytogenetics, LWA probe, each 
88274 Molecular cytogenetics, interphase in situ hybrid- 
ization, analyze 25-99 cells 
882? 1 Cytogenetics and molecular cytogenetics, interpre- 
tation and report 



Procedural Information 

Immunohistochemistry is performed using the FDA-approved 
DAKO antibody kit, Herceptest*>. The DAKO kit contains 
reagents required to complete a two-step immunohisto- 
chemical staining procedure for.routinely processed, paraffin- 
embedded specimens. Following incubation with the primary 
rabbit antibody to human HER-2yneu protein, the kit employs 
a ready-to-use dextran-based visualization reagent. This re- 
agent consists of both secondary goat anti-rabbit antibody 
molecules with horseradish peroxidase molecules linked to a 
common dextran polymer backbone, thus eliminating the need 
for sequential application of link antibody and peroxidase 
conjugated antibody. Enzymatic conversion of the subse- 
quently added chromogen results in formation of visible 
reaction product at the antigen site, the specimen is then coun- 
terstained; a pathologist using light-microscopy interprets 

results. , 

PISH analysis at SHMC/PAML is performed using the 
FD A-approved Path Vysion™ HER-2/neu DNA probe kit, pro- 
duced by Vysis, Inc. Formalin fixed, paraffin-embedded breast 
tissue is processed using routine histological methods, and then 
slides are treated to allow hybridization of DNA probes to the 
nuclei present in the tissue section. The Pathvysion™ kit con- 
tains two direct-labeled DNA probes, one specific for the 
alphoid repetitive DNA (CEP 17, spectrum orange) present at 
the chromosome 17 centromere and the second foT the HER- 
2/neu oncogene located at 17ql 1.2-12 (spectrum green). Enu- 
meration of the probes allows a ratio of the number of copies 
of chromosome 17 to the number of copies of HER-2/neu to 
be obtained; this enables quantification of low versus high 
amplification levels, and allows an estimate of the percentage 
of cells with HER-2/neu gene amplification. The clinically 
relevant distinction is whether the gene amplification is due. 
to increased gene copy number on the two chromosome 17 
homologues normally present or an increase in the number of 
chromosome 17s in the cells. In the majority of cases, ratio 
equivalents less than 2.0 are indicative of a normal/negatrve 
result ratios of 2.1 and over indicate that amplification is 
present and to what degree. Interpretation of this data will be 
performed and reported from the Vysis-certified Cytogenet- 
ics laboratory at SHMC. 
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SECOND DECLARATION OF PAUL POLARIS. Ph.D. 



1 T Paul Polakis, Ph.D., declare and say as follows; 

1 . I am currently employed by Genentech T Inc. where my job tide is Staff 
Scientist. 

2. Since joining Genenteeh in 1999, one of my primary responsibilities has 
been leading Genemecb's Tumor Antigen Project, which is a large research 
project with a primary focus on identifying tumor cell markers thar find use 
as targets for both the diagnosis and treatment of cancer in humans. 

3. As I stated in my previous Declaration dated May 7, 2004 (attached as 
Exhibit A), ray laboratory has been employing a variety of techniques, 
including microarray analysis, to identify genes which are differentially 
expressed in human tumor tissue relative to normal human tissue. The 
primary purpose of this research is to identify proteins that are abundantly 
expressed on certain human tumor tissue(s) and that are either (i) not 
expressed, or (ii) expressed at detectably lower levels, on normal tissue(s). 

4. In the course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor tissue 
at significantly higher levels than in normal human tissue. To date, we 
have successfully generated antibodies that bind to 31 of the tumor antigen 
proteins expressed from these differentially expressed gene transcripts and 
have used these antibodies to quantitatively determine the level of 
production of these tumor antigen proteins in both human tumor tissue and 
normal tissue. We have then quantitatively compared the levels of mRNA 
and protein in both the tumor and normal tissues analyzed. The results of 
these analyses are attached herewith as Exhibit B, In Exhibit B, **+" means 
that the mRNA or protein was detectably overexpressed in the tumor tissue 
relative to normal tissue and means that no detectable overexpression 
was observed in the tumor tissue relative to normal tissue. 

5. As shown in Exhibit B, of the 3 1 genes identified as being detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the mRNA level , 28 of them (Le., greater than 90%) are also detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the protein level . As such, in the cases where we have been able to 
quantitatively measure both (i) mRNA and (ii) protein levels in both (i) 
tumor tissue and (ii) normal tissue, we have observed that in the vast 
majority of cases, there is a very strong correlation between increases in 
mRNA expression and increases in the level of protein encoded by that 
mRNA. 



6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4-5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRN A m a cumor 
tissue relative to a normal tissue more often than not correlates to a similar 
increase in abundance of the encoded protein in the tumor tissue relative 10 
the normal tissue. In fact, it remains a generally accepted working 
assumption in molecular biology that increased mRNA levels are more 
often than not predictive of elevated levels of ihe encoded protein. In fact, 
an entire industry focusing on the research and development of therapeutic 
antibodies to treat a variety of human diseases, such as cancer, operates on 
this working assumption. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be 
true, and further that these statements were made with the knowledge that 
willful fals e statements and th^likc so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Tide J 8 of the United States 
Code- and that such willful statements may jeopardize the validity of the 
application or any patent issued thereon. 





* 



Paul Polakis, Ph.D. 
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EXHIBIT A 



DECLARATION OF PAUL POLAKIS, Ph.D. 
I, Paul Polalds, Ph.D., declare and say as follows: 

1 . I was awarded a Ph.D. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. Iii the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed trom tftese differentially expres sed gene transcripts and have used the s e 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protpin expression analyses described in paragraphs 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 
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expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells, 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood genera] rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein, 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 



Dated : SAM Bv: fy/ 

PauJPoiakis,PhJD. 
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SECOND DECLARATION OF PAUL FOLAKIS. Ph.D. 
I, Paul Polakis, Ph.D.. declare and say as follows: 

1 . I am currently employed by Genentech, inc. where my job title is Staff 
Scientist. 

2. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Geneniech's Tumor Antigen Project, which is a large research 
project with a primary focus on identifying tumor cell markers rhar find use 
as targets for both the diagnosis and treatment of cancer in humans. 

3. As I stated in ray previous Declaration dated May 7, 2004 (attached as 
Exhibit A), ray laboratory has been employing a variety of techniques, 
including microarray analysis, to identify genes which are differentially 
expressed in human tumor tissue relative to normal human tissue. The 
primary purpose of this research is to identify proteins that are abundantly 
expressed on certain human tumor rissue(s) and that are either (i) not 
expressed, or (ii) expressed at detectably lower levels, on normal tissue(s). 

4. In the course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human rumor tissue 
at significantly higher levels than in normal human tissue. To date, we 
have successfully generated antibodies that bind to 31 of the tumor antigen 
proteins expressed from these differentially expressed gene transcripts and 
have used these antibodies to quantitatively determine the level of 
production of these tumor antigen proteins in both human tumor tissue and 
normal tissue. We have then quantitatively compared the levejs of" mRNA 
and protein in both the tumor and normal tissues analyzed. The results of 
these analyses are attached herewith as Exhibit B. In Exhibit B, **+" means 
that the mRNA or protein was detectably overexpressed in the tumor tissue 
relative to normal tissue and means thai no detectable overexpression 
was observed in the tumor tissue relative to normal tissue. 

5. As shown in Exhibit B, of the 3 1 genes identified as being detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the mRNA level . 28 of them (i.e., greater than 90%) are also detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the protein level . As such, in the cases where we have been able to 
quantitatively measure both (i) mRNA and (ii) protein levels in both (i) 
tumor tissue and (ii) normal tissue, we have observed thai in the vast 
majority of cases, there is a very strong correlation between increases in 
raRNA expression and increases in the level of protein encoded by that 
mRNA. 



6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4-5 above and my 
knowledge of rhe relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRN A in a tumor 
tissue relative to a normal tissue more often than not correlates to a similar 
increase in abundance of the encoded protein in the tumor tissue relative to 
the normal tissue. In fact, it remains a generally accepted working 
assumption in molecular biology that increased mRNA levels are more 
often than not predictive of elevaied levels of the encoded protein. In fact, 
an entire industry focusing on the research and development of therapeutic 
antibodies to treat a variety of human diseases, such as cancer, operates on 
this working assumption, 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be 
true, and further thai these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful statements may jeopardize the validity of the 
application or any patent issued thereon. 





Paul Polakis, Ph.D. 
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EXHIBIT A 

DECLARATION OF PAUL POLAKJS, Ph,D. 
I, Paul Polakis, Ph.D., declare and say as follows: 

1 . I was awarded a Ph.D. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where ray job tide is Staff 
Scientist. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (0 not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely usefiil for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed trom these ditierenuaJly expressed gene Uausciipts and liav c us e d thes e — 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 
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expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment* or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 



By : fy</4M< 



Dated: 5/oVoY 

PauIPoiakis,PhJD. 
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Tbe DEAD box gene, DDXl, is a putative RNA helicase 
that is co-amplified with MYCN in a subset of retinoblas- 
toma (KB) and neuroblastoma (NB) tumors and cell 
lines. Although gene amplification usually involves hun- 
dreds to thousands of kilobase pairs of DNA, a number of 
studies' suggest that co-amplified genes are only overex- 
pressed if they provide a selective advantage to the cells 
in which they are amplified. Here, we further character- 
ize DDJX1 by identifying its putative transcription and 
translation initiation sites. We analyze DDX1 protein 
levels in MYCN/DDX1 -amplified NB and RB cell lines 
using polyclonal antibodies specific to DDXl and show 
that there is a good correlation with DDXl gene copy 
number, DDXl transcript levels, and DDXl protein lev- 
els in all cell lines studied. DDXl protein is found in both 
the nucleus and cytoplasm of DDXl -amplified lines but 
is localized primarily to the nucleus of nonamplified 
cells. Oxir results indicate that DDXl may be involved in 
either the formation or progression of a subset of NB 
and RB tumors and suggest that DDXl normally plays a 
role in the metabolism of RNAs located in the nucleus of 
the cell. 



DEAD box proteins are a family of putative RNA helicases 
that are characterized by eight conserved amino acid motifs, 
one of which is the ATP hydrolysis motif containing the core 
amino acid sequence DEAD (Asp-Glu-Ala-Asp) (1-3). Over 40 
members of the DEAD box family have been isolated from a 
variety of organisms including bacteria, yeast, insects, amphib- 
ians, mammals, and plants. The prototypic DEAD box protein 
is the translation initiation factor, eukaryotic initiation factor 
4A, which, when combined with eukaryotic initiation factor 4B, 
unwinds double-stranded RNA (4). Other DEAD box proteins, 
such as p68, Vasa, and An3, can effectively and independently 
destabilize/unwind short RNA duplexes in vitro (5-7). Al- 
though some DEAD box proteins play general roles in cellular 
processes such as translation initiation (eukaryotic initiation 
factor 4A (4)), RNA splicing (PRP5, PRP28, and SPP81 in yeast 
(8-10)), and ribosomal assembly (SrmB in Escherichia coli 
(11)), the function of most DEAD box proteins remains un- 
known. Many of the DEAD box proteins found in higher eu- 
karyotes are tissue- or stage-specific. For example, PL 10 
mRNA is expressed only in the male germ line, and its product 
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has been proposed to have a specific role in translational reg- 
ulation during spermatogenesis (12). Vasa and ME31B are 
maternal proteins that may be involved in embryogenesis (13, 
14). p68, found in dividing cells (15), is believed to be required 
for the formation of nucleoli and may also have a function in 
the regulation of cell growth and division (16, 17). Other DEAD 
box proteins are implicated in RNA degradation, mRNA stabil- 
ity, and RNA editing (18-20). 

The human DEAD box protein gene DDXl 1 was identified by 
differential screening of a cDNA library enriched in transcripts 
present in the two EE cell lines Y79 and RB522A (21). The 
longest DDXl cDNA insert isolated from this library was 2.4 kb 
with an open reading frame from position 1 to 2201. All eight 
conserved motifs, characteristic of DEAD box proteins are found 
in the predicted amino acid sequence of DDXl as well as a 
region with homology to the heterogeneous nuclear ribonucle- 
oprotein U, a protein believed to participate in the processing of 
heterogeneous nuclear RNA to mRNA (22, 23). The region of 
homology to heterogeneous nuclear ribonucleoprotein U spans 
128 amino acids and is located between the first two conserved 
DEAD box protein motifs, la and lb. 

The proto-oncogene MYCN encodes a member of the MYC 
family of transcription factors that bind to an E box element 
(CACGTG) when dimerized with the MAX protein (24, 25). The 
MYCN gene is amplified and overexpressed in approximately 
one-third of all NB tumors (26, 27). Amplification of MYCN is 
associated with rapid tumor progression and a poor clinical 
prognosis (26, 27). MYCN overexpression is usually achieved 
by increasing gene copy number rather than by up-regulating 
basal expression of MYGN (27, 28). Because gene amplification 
involves hundreds to thousands of kilobase pairs of contiguous 
DNA (29-32), it is possible that co-amplification of a gene 
located in proximity to MYCN may contribute to the poor 
clinical prognosis of MYCAT-amphfied tumors. The DDXl gene 
maps to the same chromosomal band as MYCN, 2p24, and is 
located -400 kb telomeric to the MYCN gene (33-36). All four 
MYCA^amplified RB tumor cell lines tested to date are ampli- 
fied for DDXl (21), 2 while approximately two-thirds of NB cell 
lines and 38-68% of NB tumors are co-amplified for both genes 
(37-39). George et al. (39) found a significant decrease in the 
mean disease-free survival of patients with DDXl I MYCN-am- 
plified NB tumors compared with MYCN- amplified tumors. 
Similarly, Squire et al. (38) observed a trend toward a worse 
clinical prognosis when both genes were amplified in the tu- 
mors of NB patients. To date, there have been no reports of a 



1 The abbreviations used are: DDXl, DEAD box 1; NB, neuroblas- 
toma; RB, retinoblastoma; RACE, rapid amplification of cDNA ends; 
PAGE, polyacrylamide gel electrophoresis; nt, nucleotide^); MOPs] 
4-morpholinepropanesulfonic acid; bp, base pair(s); kb, kilobase(s) or 
kilobase pair(s). 

2 R. Godbout, unpublished results. 
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tumor amplified only for DDXl, and the role that this gene 
plays in cancer formation and progression is not known. 

Because of the high rate of rearrangements in amplified 
DNA (31, 40), it is unlikely that a gene located -400 kb from 
the MYCN gene will be consistently amplified as an intact unit 
unless its product provides a growth advantage to the cell. 
Based on Southern blot analysis, the DDXl gene extends over 
more than 30 kb, and there are no gross rearrangements of this 
gene in DDXl -amplified tumors (21, 38). Furthermore, there is 
a good correlation between DDXl transcript levels and gene 
copy number in the tumors analyzed to date. However, we need 
to show that DDX1 protein is overexpressed in DDX1 -amplified 
tumors if we are to entertain the possibility that this protein 
plays a role in the tumorigenic process. Here, we isolate and 
characterize the 5 '-end of DDX1 mKNA and extend the DDX1 
cDNA sequence by —300 nt. We identify the predicted initia- 
tion codon of DDX1 and generate antisera that specifically 
recognize DDX1 protein. We analyze levels of DDX1 protein in 
both DDX1 -amplified and nonamplified RB and NB tumors and 
study trie subcellular location of this protein in the cell. 

MATERIALS AND METHODS 

Library Screening — A human fetal brain cDNA library (Stratagene) 
was screened using a 320-bp DNA fragment from the 5 '-end of the 
2.4-kb DDX1 cDNA previously described (23). Phagemids containing 
positive inserts were excised from A. ZAP II following the supplier's 
directions. The ends of the cDNA inserts were sequenced using the 
dideoxynoicleotide chain termination method with T7 DNA polymerase 
(Amersham Pharmacia Biotech). 

A human placenta genomic library (CLONTECH) was screened with 
the 5 '-end of DDX1 cDNA. Positive plaques were purified, and the 
genomic DNA was analyzed using restriction enzymes and Southern 
blotting. i£coRI-digested DNA fragments from these clones were sub- 
cloned into pBluescript and digested with exonuclease III and mung 
bean nuclease to obtain sequentially deleted clones. The exon/intron 
map of the 5' portion of the DDX1 gene was obtained by comparing the 
sequence of DDXl cDNA with that of the genomic DNA. 

Rapid Amplification of cDNA Ends (RACE)— We used the Ampli- 
FMDER RACE kit (CLONTECH) to extend the 5'-end of DDXl cDNA. 
Briefly, two /ig of poly(A) + RNA isolated from RB522A was reverse 
transcribed at 52 °C using either primer PI or P3 (Pig. 1A). The RNA 
template was hydrolyzed, and excess primer was removed. A single- 
stranded AmpliFINDER anchor containing an EcoKl site was ligated to 
the 3 '-end of the cDNA using T4 RNA ligase. The cDNA was amplified 
using either primer P2 or P4 (Fig. 1A) and AmpliFINDER anchor 
primer. RACE products were cloned into pBluescript. 

Primer Extension — Poly(A) + RNAs were isolated from RB and NB 
cell lines as described previously (21, 38). The 21-nt primers 5'-TTCGT- 
TCTGGGCACCATGTGT-3 ' (primer P4 in Fig. 1A) and 5'-TGGGAC- 
CTAGGGCTTCTGGAC-3 ' (primer P3 in Fig. 1A) were end-labeled with 
[y- 32 P]ATP (3000 Ci/mmol; Mandel Scientific) and T4 polynucleotide 
kinase. Each of the labeled primers was annealed to 2 jug of poly(A) + 
RNA at 45 °C for 90 min, and the cDNA was extended at 42 °C for 60 
min using avian myeloblastosis virus reverse transcriptase (Promega). 
The primer extension products were heat-denatured and run on a 8% 
polyacrylamide gel containing 7 M urea in IX TBE buffer. A G + A 
sequencing ladder served as the size standard. 

SI Nuclease Protection Assay — The Si nuclease protection assay to 
map the transcription initiation site of DDXl was performed as de- 
scribed by Favaloro et al. (41). The DNA probe was prepared by digest- 
ing genomic DNA spanning the upstream region of DDXl and exon 1 
with Aval, labeling the ends with [y- 32 P]ATP (3000 Ci/mmol) and 
polynucleotide kinase, and removing the label from one of the ends by 
digesting the DNA with Sphl (Fig. 4). The RNA samples were res us- 
pended in a hybridization mixture containing 80% formamide, 40 mM 
PIPES, 400 mM NaCl, 1 mM EDTA, and the heat-denatured Sphl-Aval 
probe labeled at the Aual site. The samples were incubated at 45 °C for 
16 h and digested with 3000 units/ml SI nuclease (Boehringer Mann- 
heim) for 60 min at 37 *C. The samples were precipitated with ethanol; 
resuspended in 80% formaldehyde, TBE buffer, 0.1% bromphenol blue, 
xylene cyanol; denatured at 90 °C for 2 min; and electrophoresed in a 7 
M urea, 8% polyacrylamide gel in TBE buffer. 

Northern and Southern Blot Analysis — Poly(A) + RNAs were isolated 
from RB and NB cell lines as described previously (21, 38). Two /xg of 



poly(A) + RNA/lane were electrophoresed in a 6% formaldehyde, 1.5% 
agarose gel in MOPS buffer (20 mM MOPS, 5 mM sodium acetate, 1 mM 
EDTA, pH 7.0) and transferred to nitrocellulose filter in 3 M sodium 
chloride, 0.3 M sodium citrate. The filters were hybridized to the follow- 
ing DNA probes, 32 P-labeled by nick translation: (i) a 1.6-kb EcoBI 
insert from DDXl cDNA clone 1042 (21), (ii) a 260-bp cDNA fragment 
spanning the 3' -end of DDXl exon 1 as well as exons 2 and 3, (iii) a 
160-bp fragment derived from the 5 '-end of DDXl exon 1, and (iv) 
a-actin cDNA to control for lane to lane variation in RNA levels. Filters 
were hybridized and washed under high stringency. Southern blot 
analysis was as described previously (21). 

Preparation ofAnti-DDXl Antiserum— To prepare antiserum to the 
C terminus of the DDXl protein, we inserted a 1.8-kb Eco RI fragment 
from bp 848 to 2668 of DDXl cDNA (Fig. IB) into EcoRI-digested 
pMAL-c2 expression vector (New England Biolabs). DH5a cells trans- 
formed with this vector were grown to mid-log phase and induced with 
0.1 mM isopropyl-l-thio-^-D-thiogalactoside. The cells were harvested 

3- 4 h postinduction and lysed by sonication. Soluble maltose binding 
protein-DDXl fusion protein was affinity-purified using amylose resin, 
and the maltose-binding protein was cleaved with factor Xa. The DDXl 
protein was purified on a SDS-PAGE gel, electroeluted, and concen- 
trated. Approximately 100 yjg of protein was injected into rabbits at 

4- 6-week intervals. For the initial injection, the protein was dispersed 
in complete Freund's adjuvant (Sigma), while subsequent injections 
were prepared in Freund's incomplete adjuvant. Blood was collected 
from each rabbit 10 days after injection, and the specificity of the 
antiserum was tested using cell extracts from RB522A. To prepare 
antiserum to the N terminus of DDXl protein, a DDXl cDNA fragment 
from bp 268 to 851 (Fig. IB) was inserted into pGEX-4T2 (Amersham 
Pharmacia Biotech). The recombinant protein produced from this con- 
struct contains the first 186 amino acids of the predicted DDXl se- 
quence. Soluble glutathione 5-transferase-DDXl fusion protein was 
purified with glutathione-Sepharose 4B (Amersham Pharmacia Bio- 
tech). The glutathione S-transferase component of the fusion protein 
was cleaved with thrombin. 

Subcellular Fractionations and Western Blot Analysis — We used two 
different procedures for subcellular fractionations. First, we isolated 
nuclear and S100 (soluble cytoplasmic) fractions from RB522A, IMR-32, 
Y79, RB(E)-2, HeLa, and HL60 using the procedure of Dignam (42). On 
average, we obtained 5-6 times more protein in the cytosolic fractions 
than in the nuclear fractions. Second, 10 s RB522A cells were lysed and 
fractionated into S4 (soluble cytoplasmic components), P2 (heavy mito- 
chondria, plasma membrane fragments), P3 (mitochondria, lysozymes, 
peroxisomes, and Golgi membranes), and P4 fractions (membrane ves- 
icles from rough and smooth endoplasmic reticulum, Golgi, and plasma 
membrane) by differential centrifugation (43). We obtained 8 mg of 
protein in the S4 fraction, 1 mg in P2, 0.5 mg in P3, and 2 mg in P4 
fraction. The procedures related to the immunoelectron microscopy 
have been previously described (44). 

For Western blot analysis, proteins were electrophoresed in poly- 
acrylamide-SDS gels and electroblotted onto nitrocellulose using the 
standard protocol for protein transfer described by SchleicheT and^ 
Schuell. The filters were incubated with a 1:5000 dilution of DDXl 
antiserum, a 1:200 dilution of anti-MYCN monoclonal antibody (Boeh- 
ringer Mannheim), or a 1:200 dilution of anti-actin (Santa Cruz Bio- 
technology, Inc., Santa Cruz, CA). For the colorimetric analysis, anti- 
gen-antibody interactions were visualized using either alkaline 
phosphatase-linked goat anti-rabbit IgG (for DDXl) or goat anti-mouse 
IgG (for MYCN) at a 1:3000 dilution. For the ECL Western blotting 
analysis (Amersham Pharmacia Biotech), we used a 1:100,000 dilution 
of peroxidase-linked secondary anti-rabbit IgG antibody (for DDXl) or 
secondary anti-goat IgG antibody (Jackson ImmunoResearch 
Laboratories). 

RESULTS 

Identification of the 5'-End of the DDXl Transcript We 

have previously reported the sequence of DDXl cDNA isolated 
from an RB cDNA library (21, 23). This 2.4-kb DDXl cDNA 
contains an open reading frame spanning positions 1-2201 
with a methionine encoded by the first three nucleotides (Fig. 
1A). There is a polyadenylation signal and poly(A) tail in the 
3 '-untranslated region, indicating that the sequence is com- 
plete at the 3'-end. Manohar et al. (37) have also isolated DDXl 
cDNA from the NB cell line LA-N-5. Their cDNA extended the 
5 '-end of our sequence by 42 bp and included an additional in 
frame methionine (double underlined in Fig. 1A). The possibil- 
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Fig. 1. Partial sequence and structure of DDXl cDNA. A, the 
sequence of the 5 '-end of DDXl cDNA. The sequence in boldface type 
starting at the asterisk was obtained using the RACE strategy. The 
additional 6 bp in italic boldface type at the 5 '-end of the cDNA are 
predicted based on the known DDXl genomic sequence and primer 
extension analysis. PI, P2, P3, and P4 are primers used in the RACE 
experiments (the complementary sequence was used in each case). 
Primers P3 and P4 were also used for the primer extension analysis. 
Three in frame methionine codons are indicated by the double under- 
line. An in frame stop codon is indicated by the boldface double under- 
line. The three major transcription initiation sites identified by primer 
extension are indicated by the single arrows, while a minor site is 
represented by the broad arrow. The predicted DDXl transcription 
initiation sites obtained by RACE, SI nuclease, and primer extension 
are indicated as well as the 5 '-ends of DDXl cDNA sequences obtained 
by screening cDNA libraries. The sequences transcribed from exons 1, 
2. and 3 are also shown. B i the structure of the 2711-bp DDXl cDNA is 
shown with an open reading frame from position 295 to 2515. 

ity of additional in frame methionines located further upstream 
could not be excluded, because there were no predicted stop 
codons in the upstream region of the cDNA. 

Northern blot analysis indicated a DDXl transcript size of 
-2800 nt, suggesting that the DDXl cDNAs isolated to date 
were lacking ~300— 350 bp of 5' sequence. We have used dif- 
ferent approaches to identify the transcription start site of 
DDXl. First, we exhaustively screened a commercial fetal 
brain cDNA library with the 5 '-end of DDXl cDNA. Although 
numerous clones were analyzed, only one extended the se- 
quence (by 35 bp) beyond that published by Manohar et al (37) 
(Fig. 1A). 

We next used the RACE procedure in an attempt to isolate 
additional 5' sequence. The nested primers used to amplify the 
5 '-end of the DDXl transcript are labeled as primers PI and P2 
in Fig. 1A and are located downstream of the three in frame 
methionines (double underlined in Fig. 1A). Poly(A) + RNA 
from RB522A was reverse transcribed at 52 °C using primer 
PI, and the reverse transcribed cDNA was amplified using the 
nested primer P2 and the 5 '-RACE primer. Using this ap- 
proach, we generated a product that was 230 bp longer than 
any of the cDNAs obtained by screening libraries (Fig. 1A). 
Sequencing of this 230-bp cDNA revealed an in frame stop 
codon (boldface double underline in Fig. LA) located 123 bp 




Fig. 2. Identification of the 5 -end of the DDXl transcript by 
primer extension. Radioactively labeled primer P4 was annealed to 2 
of polyCAT RNA from RB522A {lane 2), 1 of poly(A) + RNA from 
RB522A {lane 2) t and 2 ^g of poly(A) + RNA from RB(E)-2 cells {lane 3), 
and extended using reverse transcriptase. The products were run on an 
8% denaturing polyacrylamide gel with a G + A sequencing ladder as 
size marker. The primer extension products are indicated on the left. 
The sizes of the products (in nt) are presented as the distance from 
primer P4. 

upstream of the predicted translation initiation site. We then 
prepared primers P3 and P4, located near the 5'-end of the 
RACE cDNA (Fig. 1A) and repeated the RACE procedure to see 
if additional 5' sequences could be obtained. The resulting 
RACE products did not extend the DDXl cDNA sequence 
further. 

The location of the DDXl transcription initiation site was 
verified by primer extension. Poly(A) + RNA was prepared from 
the following two cell lines: DDXi-amplified RB cell line 
RB522A and a nonamplified RB cell line RB(E)-2. RB522A has 
elevated levels of DDXl mRNA, while RB(E)-2 has at least 
20-fold lower levels of this transcript. Three products of 40, 43, 
and 46 nt (with a weak signal at 45 nt) were detected in 
RB522A using primer P4 (Figs. 1A and 2). The 40-nt product 
corresponded exactly with the 5'-end of the RACE-derived 
cDNA while the 43- and 46-nt products extended the predicted 
size of the DDXl transcript by 3 and 6 nt, respectively. None of 
these products were observed in RB(E)-2. Bands of identical 
sizes to those obtained with RB522A mRNA were also observed 
in the DDXl -amplified NB cell line BE(2)-C but not in the 
DDXl-amplified NB cell line IMR-32 (data not shown). The 
same predicted DDXl transcription initiation site was identi- 
fied with primer P3 except that the bands were of weaker 
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Fig. 3. Genomic map of the 5' -end of DDXl. The exons are represented by the black boxes, and distances are in kilobase pairs. The locations 
of EcoBl (E) sites are indicated. 



intensity (data not shown). We have designated the transcrip- 
tion start site identified by primer extension as + 1 (Fig. LA). 

The sequence of the 6 nt extending beyond the RACE cDNA 
was obtained by comparison of the cDNA sequence with that of 
DDXl genomic DNA. Bacteriophages containing DDXl 
genomic DNA were isolated by screening a human placenta 
library with 5' DDXl cDNA. Eighteen kb of DNA were se- 
quenced from two bacteriophages with overlapping DDX1 
genomic DNA. Thirteen exons were identified within this 18-kb 
region (Fig. 3) corresponding to cDNA sequences from position 

1 to 1249. The 310-bp exon 1 was by far the longest of the 13 
exons sequenced, corresponding to the entire 5 '-un translated 
region of DDX1 as well as the first in frame methionine. The 
sequences transcribed from exons 1, 2, and 3 are indicated in 
Fig. LA. 

Knowledge of the genomic structure of DDXl allowed us to 
use the S 1 protection assay, a technique that is independent of 
reverse transcriptase, to further define the 5-' -end of the- DDXl' 
transcript. Poly(A) + RNAs from six DDX1 -amplified lines (RB 
lines: Y79 and KB522A; NB lines: BE(2)-C, IMR-32, LA-N-1, 
and LA-N-5) and six nonamplifled lines (RB lines: RB(E)-2 and 
RB412; NB lines, GOTO, NB-1, NUB-7, and SK-N-MC) were 
hybridized to a DNA probe that extended from position -745 in 
the S'-flanking DDX1 DNA to position +164 in exon 1. This 
DNA probe was labeled at position +164 as indicated in Fig. 4. 
Nonhybridized DNA was digested with SI nuclease, and the 
sizes of the protected fragments were analyzed on a denaturing 
polyacrylamide gel. Bands of 150-153 nt were observed in lane 

2 (RB522A), lane 5 (BE(2)-C), and lane 8 (LA-N-1) with bands 
of much weaker intensity in lane 7 (IMR-32) (Fig. 4). Specific 
bands were not detected in either DDX1 -amplified Y79 and 
LA-N-5 or the nonamplified lines. Although the sizes of the SI 
protected bands in RB522A, BE(2)-C, and LA-N-1 were 5 and 
11 nt shorter than predicted based on RACE and primer ex- 
tensions, respectively, there was general agreement with all 
three techniques regarding the location of the DDXl transcrip- 
tion initiation site (Fig. LA). The smaller SI nuclease protected 
products could have arisen as the result of SI digestion of the 
5 '-end of the RNAjDNA heteroduplex because of its relatively 
high rU:dA content (45). 

Identification of the same transcription initiation site in 
three DDX1 -amplified lines suggests that this represents the 
bona fide start site of DDX1 transcription. However, it was not 
clear why this start site was either very weak or not detected in 
three other amplified lines. To determine whether the 5' -end of 
exon 1 is transcribed in all DDX1 -amplified lines, we carried 
out a direct analysis of the 5 '-end of the DDX1 transcript by 
Northern blotting. Two probes were used for this analysis: the 
5' probe contained a 160-bp fragment from bp 1 to 160 (5 '-half 
of exon 1), and the 3' probe contained a 260-bp fragment from 
bp 160 to 420 (3'-half of exon 1 as well exons 2 and 3) (Fig. 1A). 
With the 3' probe, we obtained bands of similar size and inten- 
sity in four DDX1 -amplified lines (RB522A, BE(2)-C, IMR-32, 
and LA-N-5). Band intensity was somewhat weaker in Y79 and 
stronger in LA-N-1 in comparison with the other lines (Fig. 5). 
No signal was detected in the non-DDXZ -amplified line RB412, 
With the 5' probe, a relatively strong signal was observed in 
RB522A, BE(2)-C, and LA-N-1, while a considerably weaker 



but readily apparent signal was detected in Y79, IMR-32, and 
LA-N-5. The signal obtained with actin indicates that, with the 
exception of LA-N-1, similar amounts of RNA were loaded in 
each lane and that the RNA was not degraded. These results 
indicate that at least a portion of the 160-bp 5 '-end of exon 1 is 
transcribed in all ZXDXZ -amplified lines. 

Based on primer extension, SI nuclease protection assay, 
Northern blot analysis and the sequencing of the RACE prod- 
ucts, we conclude that the DDX1 transcript is 2.7 kb with an 
open reading frame spanning nucleotides 295-2515 encoding a 
predicted protein of 740 amino acids with an estimated molec- 
ular weight of 82.4 (Fig. IB). An in frame stop codon is located 
123 nt upstream of the predicted translation initiation site, at 
positions 172-174. The first in frame methionine following the 
stop codon is in agreement with the Kozak consensus sequence 
(46). Furthermore, the predicted start methionine codon for 
human DDX1 corresponds perfectly with that of Drosophila 
DDXl (47). A stop codon is located 15 nt upstream of the 
initiation codon in Drosophila DDXl. 

Analysis of DDXl Protein Levels in Neuroblastoma and Ret- 
inoblastoma — We and others have previously shown that there 
is a good correlation between gene copy number and RNA levels 
mDDXi -amplified RB and NB cell lines (37, 38). To determine 
whether the correlation extends to DDXl protein levels, we 
prepared antiserum to two nonoverlapping recombinant DDXl 
proteins. First, we prepared a C terminus recombinant protein 
construct by inserting a 1.8-kb EcoRl fragment from bp 848 to 
2668 (amino acids 185-740) (Fig. LB) into the pMAL-c2 expres- 
sion vector. Recombinant protein expression was induced with 
isopropyl-l-thio-)3-D-thiogalactoside, and the 110-kDa maltose- 
binding protein-DDXl fusion product was purified by affinity 
chromatography using axnylose resin, followed by electrophore- 
sis on a SDS-PAGE gel after cleaving the maltose-binding 
protein fusion partner with factor Xa. Second, we prepared an 
N terminus construct by ligating a DNA fragment from bp 268 
to 851 (amino acids 1-186) into pGEX-4T2, The 50-kDa gluta- 
thione iS-transferase-DDXl fusion protein was purified by af- 
finity chromatography on a glutathione column. This N termi- 
nus fusion protein contains only the first of the eight conserved 
motifs found in all DEAD box proteins, while the C terminus 
fusion protein includes the remaining seven motifs. 

We measured DDXl protein levels in total cell extracts of 
three RB and 10 NB cell lines. Using antiserum to the N 
terminus fusion protein, we observed a strong signal in all 
DDXl -amplified cell lines: the RB cell lines Y79 (lane 1) and 
RB522A (lane 2) and the NB cell lines BE(2)-C (lane 4), IMR-32 
(lane 6), LA-N-1 (lane S), and LA-N-5 (lane 9) (Fig. 6). Two 
bands were observed in the majority of extracts. Of the ampli- 
fied lines, Y79 produced the weakest signal, with the most 
intense signal observed in LA-N-1. There was an excellent 
correlation with DDXl protein and mRNA levels in these cell 
lines, with lower levels of DDXl mRNA observed in Y79 and 
higher levels in LA-N-1 (Fig. 7A). As shown in Fig. 7B, this 
correlation extended to DDXl gene copy number. No gross 
DNA rearrangements were seen in the DDXl -amplified lines; 
however, three small bands of altered size were observed in the 
RB412 lane. Although the nature of the DNA alteration is not 
known, it is noteworthy that DDXl transcript levels in RB412 
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Fig. 4. Si nuclease mapping of the 
5 '-end of the DDXl transcript. Two /ig 

of poly(A) + RNA from four RB lines 
{DDXl- amplified Y79 and RB522A and 
nonamplified RB(E)-2 and RB412), eight 
NB lines CDOTi-amplified BE(2)-C, IMR- 
32, LA-N-1, and LA-N-5 and nonamplified 
GOTO, NB-1, NUB-7, and SK-N-MC), 
and tRNA as a negative control were hy- 
bridized to a Sphl-Aval fragment labeled 
at the Aval site with [y- 32 P]ATP and 
polynucleotide kinase. Bands of 150-153 
nt are shown in lanes 2 (RB522A), 5 
(BE(2)-C), and 5 (LA-N-1) with much 
weaker bands in lane 7 (IMR-32). A map 
of the probe indicating the transcription 
initiation site identified by primer exten- 
si6h"(+2), the labeling site (*), and exons 
1 and 2, is shown at the bottom. 
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are extremely low (Fig. 7A) and that the top DDXl protein band 
in RB412 cell extracts is smaller in size than the top band from 
the other cell extracts (Fig. 6). 

Two DDX1 protein bands were present in most of the lanes in 
Fig. 6. The same two bands were detected with antiserum to 
the C terminus of the DDX1 protein, as well as a third band at 
—60 kDa (data not shown). There was no variation in the 
intensity of the 60-kDa band in DDXl -amplified and nonam- 
plified cell extracts. The 60-kDa band probably represents an- 
other member of the DEAD box protein family, because the C 
terminus DDX1 protein used to prepare this antiserum con- 
tained seven of the eight conserved motifs found in all DEAD 
box proteins. To obtain an estimate of the size of the two DDX1 
bands, we ran cellular extracts from RB522A on a 7% SDS- 
PAGE gel with the BenchMark protein ladder (Life Technolo- 
gies, Inc.). The size of the DDX1 protein was determined using 
the Alpha Imager 2000 documentation and analysis system for 
molecular weight calculation. Based on this analysis, the esti- 
mated molecular mass of the top band is 89.5 kDa, while that 
of the bottom band is 83.5 kDa. The 84-kDa band may repre- 
sent the unmodified product encoded by the DDXl transcript 
(capable of encoding a protein with a predicted molecular mass 
of 82.4 kDa), while the top band may represent post-transla- 
tional modification of DDXl protein (e.g. phosphorylation). An- 
other possibility is that the top band represents intact DDXl 




Fig. 5. Northern blot analysis of the 5'-end of the DDXl tran- 
script. Two ng of poly(A) + RNA isolated from Z)Z>X7-amplined Y79 
EB522A, BE(2)-C, IMR.32, LA-N-1, and LA-N-5 and nonamplified 
RB412 were electrophoresed in a 1.5% agarose-formaldehyde gel. The 
RNA was transferred to a nitrocellulose filter and sequentially hybrid- 
ized with a 260-bp fragment from DDXl cDNA from bp +160 to +420 
(3'-end of exon 1 as well as exons 2 and 3) CA), a 160-bp fragment from 
DDXl cDNA from bp + 1 to + 160 (5'-end of exon 1) (B), and actin cDNA 
(C). The DNA was labeled with P^JdCTP by nick translation. The blots 
were hybridized and washed under high stringency. 
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Fig. 6. DDXl protein expression in KB and NB cell lines. West- 
ern blots were prepared using total cellular extracts from three RB 
(Y79, RB522A, and RB412) and 10 NB cell lines (BE(2)-C GOTO 
IMR-32, KAN, LA-N-1, LA-N-5, NB-1, NUB-7, SK-N-MC, and SK-N- 
SH). The lines that are amplified for the DDXl gene are Y79, RB522A, 
BE(2)-C, IMR-32, LA-N-1, and LA-N-5. Twenty yjg of protein were 
loaded in each lane and electrophoresed in a 10% SDS-PAGE gel. DDXl 
was detected using a 1:5000 dilution of the antiserum to the amino 
terminus of DDXl protein. Size markers in kilodaltons are indicated on 
the side. 




Fig. 7. Northern and Southern blot analyses of DDXl in RB 
and NB cell lines. A, 2 ^g of poly(A)* RNA were loaded in each lane, 
electrophoresed in a 1.5% agarose-formaldehyde gel, and transferred to 
a nitrocellulose filter. The filter was first hybridized to a 32 P-labeled 
1.6-kb DDXl cDNA (clone 1042) (21), stripped, and rehybridized to 
actin DNA. B, 10 jxg of genomic DNA from each of the indicated cell 
lines were digested with EcoBl, electrophoresed in a 1% agarose gel, 
and transferred to a nitrocellulose filter. The filter was hybridized to 
32 P-labeled clone 1042 DDXl cDNA, stripped, and reprobed with la- 
beled a-fetoprotein cDNA. Markers (in kilobase pairs) are indicated on 
the side. 

and the lower band is a specific truncated or degradation prod- 
uct of DDXl. Yet a third possibility is that the two bands 
represent the products of differentially spliced transcripts or 
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Fig. 8. Distribution of DDXl in the nucleus and cytoplasm. A, 
cytosolic and nuclear extracts were prepared from RB522A and electro- 
phoresed in a 7% SDS-PAGE gel. "Cytosolic extracts were loaded in 
lanes 1 (20 /ig of protein) and 2 (10 jig), while nuclear extracts were 
loaded in lanes 3 (10 ptg) and 4 (20 jLg). DDXl was visualized using a 
1:5000 dilution of the antiserum to the N terminus. The BenchMark 
protein ladder size markers (kilodaltons) are indicated on the left. B, 
cytosolic and nuclear extracts were prepared from HL60, Y79, IMR-32, 
HeLa, RB522A, and RB(E)-2 and electrophoresed in an 8% SDS-PAGE 
gel. Twenty ptg of proteins were loaded in each lane marked C (cytosolic) 
and N (nuclear). DDXl was visualized using a 1:5000 dilution of the 
antiserum to the N terminus. Actin levels were analyzed using a 1:200 
dilution of anti- actin antibody (Santa Cruz Biotechnology). 

different translation initiation sites. However, the lack of any 
obvious differences in DDXl transcript sizes in the three RB 
and 10 NB lines analyzed in Fig. 7 A does not support the latter 
possibility (e.g. compare the DDXl transcript size in NUB-7 
(which produces the lower DDXl protein band) and in NB-1 
(which produces the higher DDXl protein band)). 

Subcellular Localization of DDXl Protein — DEAD box pro- 
teins have been implicated in a variety of cellular functions 
including RNA splicing in the nucleus, translation initiation in 
the cytoplasm, and ribosome assembly in the nucleolus. To 
obtain an indication of the possible role of DDXl, we studied its 
subcellular location. Nuclear and cytosolic extracts were pre- 
pared from DDXl -amplified RB522A and run on a 7% SDS- 
PAGE gel Although there was more DDXl protein in the 
cytosol than in the nucleus on a per cell basis, the proportion of 
DDXl protein relative to total protein was similar in both 
cellular compartments (Fig. 8A). Both the 90- and 84-kDa 
bands were present in cytosol and nuclear extracts, although 
the bottom band was more readily apparent in the cytosol. By 
running the gel for an extended period of time (twice as long as 
usual), we were able to detect an additional weak band at —88 
kDa in both nuclear and cytosolic extracts. 

To deterinine whether DDXl consistently localizes to both 
the cytoplasm and nucleus, we prepared cytosol and nuclear 
extracts from two additional DDXl -amplified lines, Y79 and 
IMR-32, as well as from nonamplified RB(E)-2, HL60, and 
HeLa. DDXl protein was found in both the nucleus and cyto- 
plasm of IMR-32, primarily in the cytoplasm of Y79, and 
mainly in the nucleus of the three nonamplified lines (Fig, SB). 
In addition, DDXl was almost exclusively found in nuclear 



DDXl Overexpression in Retinoblastoma and Neuroblastoma 



21167 



A 

1 2 3 4 5 6 



DDX1 



B 

MYCN 




FiG^IL SoibceUular location of DDXl protein. R£522A cells were 
fractionated into nuclear {lane 1), S100 and S4 cytosol {lanes 2 and 3\ 
P2 membrane {lane 4), P3 membrane (tone 5), and P4 membrane (/ane 
6) fractions. Twenty pt,g of protein were loaded in each lane and run on 
a 10% SDS-PAGE gel. A, DDXl protein was detected using a 1:5000 
dilution of the antiserum to the N terminus of DDXl. B, MYCN protein 
was detected using a commercially available antibody at a 1:200 dilu- 
tion. Size markers Qrilodaltons) are indicated on the side. 

extracts prepared from normal GM38 fibroblasts (data not 
shown). We used anti-actin antibody to ensure that our nuclear 
and cytosolic extracts were not cross-contaminated (Fig. 8£). 

We next carried out a more detailed analysis of DDXl sub- 
cellular location using two different approaches: (i) fraction* 
ation of cellular components into nuclei; SI 00 or S4 cytosol 
(containing soluble cytoplasmic components, including 40 S 
ribosomes); P2 (heavy mitochondria, plasma membrane frag- 
ments plus material trapped by these membranes); P3 (mito- 
chondria, lysosomes, peroxisomes, Golgi membranes, some 
rough endoplasmic reticulum); and P4 (microsomes from 
smooth and rough endoplasmic reticulum, Golgi and plasma 
membranes) (43); and (ii) immunogold electron microscopy. 
The Z^DXi -amplified RB522A cell line was used for both exper- 
iments. The fractionation procedures indicate that DDXl is 
mainly in the nucleus and in the cytosol (S4 and S100 fractions) 
of RB522A cells (Fig. 9A). As a control, we used anti-human 
MYCN antibody to determine the location of MYCN (also am- 
plified in RB522A) in our subcellular fractions. As shown in 
Fig. 95, MYCN was primarily found in the nucleus, as one 
would expect of a transcription factor. 

For the electron microscopy analysis, antiserum to the N 
terminus of DDXl was coupled to protein A gold particles, and 
the distribution of DDXl was examined in RB522A cells fixed 
in paraformaldehyde and glutaraldehyde. DDXl was present 
in both the cytoplasm and nucleus (data not shown). There was 
no association with either cell organelles or with nuclear or 
plasma membranes. 

DISCUSSION 

There are presently few clues as to the function of DDXl in 
normal and cancer cells. Our earlier data indicate that DDXl 
mRNA is present at higher levels in fetal tissues of neural 
origin (retina and brain) compared with other fetal tissues (21). 



There may therefore be a requirement for elevated levels of this 
putative RNA helicase for the efficient production or processing 
of neural specific transcripts. A role in cancer formation or 
progression is an intriguing possibility, because overexpression 
of an RNA unwinding protein could affect the secondary struc- 
ture of RNAs in such a way as to alter the expression of specific 
proteins in tumor cells. DDXl is co-amplified with MYCN in a 
subset of RB and NB cell lines and tumors (37-39). MYCN 
amplification is common in stage IV NB tumors and is a well 
documented indicator of poor prognosis. A general trend to- 
ward a poorer clinical prognosis is observed when both the 
MYCN and DDXl genes are amplified compared with when 
only MYCN is amplified (38, 39), suggesting a possible role for 
DDXl in NB tumor formation or progression. 

It is generally accepted that co-amplified genes are not over- 
expressed unless they provide a selective growth advantage to 
the cell (48, 49). For example, although ERBA is closely linked 
to ERBB2 in breast cancer and both genes are commonly am- 
plified in these tumors, ERBA is not overexpressed (48). Sim- 
ilarly, three genes mapping to 12ql3-14 (CDK4 f SAS, and 
MDM2) are overexpressed in a high percentage of malignant 
gliomas showing amplification of this chromosomal region, 
while other genes mapping to this region (GADD153, GLI, and 
A2MR) are rarely overexpressed in gene-amplified malignant 
gliomas (50, 51). The first three genes are probably the main 
targets of the amplification process, while the latter three 
genes are probably incidentally included in the amplicons. The 
data shown here indicate that DDXl is overexpressed at both 
the protein and RNA levels in £D^2-amplified RB and NB cell 
lines and that there is a strong correlation between DDXl gene 
copy number, DDXl RNA levels, and DDXl protein levels in 
these lines. Our results are therefore consistent with DDXl 
overexpression playing a positive role in some aspect of NB and 
RB tumor formation or progression. Recently, Weiss et aL (52) 
have shown that transgenic mice that overexpress MYCN de- 
velop NB tumors several months after birth. They conclude 
that MYCN overexpression can contribute to the initiation of 
tumorigenesis but that additional events are required for tu- 
mor formation. Amplification of DDXl may represent one of 
many alternative pathways by which a normal precursor "neu- 
roblast" or "retinoblast" cell gains malignant properties. 

The function of the majority of tissue-specific or development 
tally regulated DEAD box genes remains unknown. However, 
some members of this protein family have been either directly 
or indirectly implicated in tumorigenesis. For example, the p68 
gene has been found to be mutated in the ultraviolet light- 
induced murine tumor 8101 (53), while DDX6 (also known as 
RCK or p54) is encoded by a gene located at the breakpoint of 
the translocation involving chromosomes 11 and 14 in a cell 
line derived from a B-cell lymphoma (54, 55). Similarly, the 
production of a chimeric protein between DDX10 and the 
nucleoporin gene NUP98 has been proposed to be involved in 
the pathogenesis of a subset of myeloid malignancies with 
inv(ll) (pl5q22) (56). Interestingly, Grandori et al. (57) have 
shown that MYCC interacts with a DEAD box gene called 
MrDb, suggesting that the transcription of some DEAD box 
genes could be regulated through interaction with members of 
the MYC family. Future work will involve deterniining whether 
DDXl represents another member of the DEAD box family 
with a role in the tumorigenic process. 

DEAD box proteins have been implicated in translation ini- 
tiation, RNA splicing, RNA degradation, and RNA stability (3, 
18, 19). We carried out subcellular localization studies in an 
attempt to obtain a general indication of the function of DDXl. 
We found DDXl protein in both the cytoplasm and nucleus of 
DDZI- amplified NB and RB lines. In contrast, DDXl was 
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mainly located in the nucleus of nonamplified cell lines and 
normal fibroblast cultures. DDX1 was not associated with cel- 
lular organelles or with membranes based on immunoelectron 
microscopy. We therefore propose that the primary role of 
DDX1 is in the nucleus. The presence of DDXl in the cytoplasm 
of DDX1- amplified cells may indicate that the amount of DDXl 
protein that is allowed in the nucleus is tightly regulated. 
Alternatively, DDXl may play a dual role in the nucleus and 
cytoplasm of DDXl -amplified cells. 

An important component of our analysis was to identify the 
translation and transcription initiation sites of DDXl, We used 
a combination of techniques to identify the transcription start 
site: screening of RB and fetal brain libraries, RACE, primer 
extension, genomic DNA sequencing, SI nuclease mapping, 
and Northern blot analysis using probes to the predicted 5 '-end 
of the transcript. The transcription start site identified using 
these techniques is located —300 nt upstream of the predicted 
translation initiation codon and was readily detected in three 
DDXl- amplified lines and barely detectable in a fourth ampli- 
fied line. The 5 '-untranslated region as well as the first in 
frame methionine are encoded within the first exon of DDXl. 
An in frame stop codon is located 123 nt upstream of the 
predicted initiation codon. We were unable to identify the tran- 
scription initiation site of DDXl in two of the six amplified lines 
tested as well as in nonamplified lines. Although it remains 
possible that there are different transcription start sites in 
different cell lines, -detection of lower- levels (rather than the 
absence) of the 5 '-most 160 nt of the DDXl transcript in IMR- 
32, Y79, and LA-N-5 compared with RB522A, BE(2)-C, and 
LA-N-1 supports a quantitative rather than a qualitative dif- 
ference in the 5 '-end of this transcript in these cells. Our 
results suggest that the 5' -end of DDXl mRNA is rarely intact, 
even in mRNA preparations that otherwise appear to be of high 
quality based on analysis of control transcripts. The 5' -end of 
DDXl mRNA may therefore be especially susceptible to degra- 
dation, perhaps because of its sequence and/or secondary 
structure. 

In conclusion, we have mapped the 5 '-end of the 2.7-kb DDXl 
transcript and have identified the predicted translation initia- 
tion site of DDXl protein. We have found that DDJO -amplified 
RB and NB tumor lines overexpress DDXl protein and that 
there is a good correlation between gene copy number and both 
transcript and protein levels in these cells. We have shown that 
DDXl protein is primarily located in the nucleus of cells that 
are not DDXl -amplified. In contrast, DDXl is present in both 
the nucleus and cytoplasm of DDXl- amplified NB and RB 
lines. A cytoplasmic location in DDXl -amplified lines may in- 
dicate that the amount of nuclear DDXl is tightly regulated or 
that DDXl plays a dual role in the cytoplasm and nucleus of 
these cells. 
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Aneupioidy and cancer 

Subrata Sen, PhD 



Numeric aberrations In chromosomes, referred to as aneu- 
pioidy, ts commonly observed in human cancer, Whether aneu- 
pioidy is a cause or consequence of cancer has long been 
debated. Three lines of evidence now make a compelling case 
for aneupioidy being a discrete chromosome mutation event 
that contributes to malignant transformation and progression 
process. First, precise assay of chromosome aneupioidy in 
several primary tumors with in situ hybridization and compara- 
tive genomic hybridization techniques have revealed that 
specific chromosome aneusomies correlate with distinct tumor 
phenotypes. Second, aneuptoid tumor cell lines and in vitro 
transformed rodent ceils have been reported to display an 
elevated rate of chromosome instability, thereby indicating that 
aneupioidy is a dynamic chromosome mutation event associ- 
ated with transformation of cells. Third, and most important, a 
number of mitotic genes regulating chromosome segregation 
have been found mutated in human cancer ceHs, implicating 
such mutations in induction of aneupioidy in tumors. Some of 
these gene mutations, possibly allowing unequal segregations 
of chromosomes, also cause tumorigenic transformation of 
ceils in vitro. In this review, the recent publications investigate 
ing aneupioidy in human cancers, rate of chromosome instabil- 
ity in aneupioidy tumor cells, and genes implicated in regulat- 
ing chromosome segregation found mutated in cancer celts 

are discussed. Curt Opfn Onccrf 2000, 1255-88 C 2000 Uppincoll Williams 
4 Wittdns, Inc. 
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Cancer research over rhc past decade has firmly estab- 
lished that malignant cells accumulate a large number of 
genetic mutations that affect differentiation, prolifera- 
tion, and cell death processes, In addition, it is also 
recognized that most cancers are clonal, although they 
display extensive heterogeneity with respect to kary- 
otypes and phenotypes of individual clonal populations. 
It is estimated that numeric chromosomal imbalance, 
referred to as aneupioidy^ is the most prevalent genetic 
change recorded among over 20,000 solid tumors 
analyzed thus far flj. Phenotypic diversity of the clonal 
populations in individual tumors involve differences in 
morphology, proliferative properties, antigen expression, 
drug sensitivity, and metastatic potentials. It has been 
proposed that an underlying acquired genetic instability 
is responsible for the multiple mutations detected in 
= cancer cells that lead to tumor heterogeneity and 
progression f 2]. In a somewhat contradictory argument, 
it has also been suggested that clonal expansion due to 
selection of cells undergoing normal rates of mutation 
can. explain malignant transformation and progression 
process in humans [3]. Acquired genetic instability, 
nonetheless, is considered important for more rapid 
progression of the disease I4»»J* Although the original 
hypothesis on genetic instability in cancer primarily 
focused on chromosome imbalances in the form of aneu- 
pioidy in tumor cells, the actual relevance of such muta- 
tions in cancer remains a controversial issue. 

Whether or not aneupioidy contributes to the malignant 
transformation and progression process has long been 
debated. A prevalent idea on genetics of cancer referred 
to as "somatic gene mutation hypothesis" contends that 
gene mutations at the nucleotide level alone can cause 
cancer by either activating cellular proto-oncogencs to 
dominant cancer causing oncogenes and/or by inactivat- 
ing growth inhibitory tumor suppressor genes. In this 
scheme of things chromosomal instability in the form of 
aneupioidy is a mere consequence rather than a cause of 
malignant transformation and progression process. 

In this review, some of the recent observations on the 
subject arc discussed and compelling evidence is 
provided to suggest that aneupioidy is a distinct form of 
genetic instability in cancer that frequently correlates 
with specific phenotypes and stages of the disease, 
Furthermore, discrete genetic targets affecting chromo- 
somal stability in cancer cells, recently identified, arc 
also discussed* These data provide a new direction 
toward elucidating the molecular mechanisms rcsponsi- 
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ble for induction of ancuploidy in cancer and may even- 
tually be exploited as novel therapeutic targets in the 
future. 

Genetic alterations in cancer 

Alterations in many genetic loci regulating growth, 
senescence, and apoptosis, identified in tumor cells, 
have led to the current understanding of cancer as a 
genetic disease. The genetic changes identified in 
tumors include: subtle mutations in genes at the 
nucleotide level; chromosomal translocations leading to 
structural rearrangements in genes; and numeric 
changes in either partial segments of chromosomes or 
whole chromosomes (ancuploidy) causing imbalance in 
gene dosage. 

For the purpose of this review, both segmental and whole 
chromosome imbalances leading to altered DNA dosage 
in cancer cells arc included as examples of ancuploidy. 

Incidence of aneuploidy in cancer 

Evidence of ancuploidy involving one or more chromo- 
somes have been commonly reported in human tumors. 
Although these observations were initially made using 
classic cytogenetic techniques late in a tumor's evolu- 
tion and were difficult to correlate with cancer progres- 
sion, more recent studies have reported association of 
specific nonrandom chromosome ancuploidy with 
different biologic properties such as loss of hormone 
dependence and metastatic potential [5], 

Classic cytogenetic studies performed on tumor cells 
had serious limitations in scope because they were 
applicable only to those cases in which mitotic chromo- 
somes could be obtained. Because of low spontaneous 
rates of cell division in primary tumors, analyses 
depended on cells either derived selectively from 
advanced metastases or those grown in vitro for variable 
periods of time. In both instances, metaphascs analyzed 
represented only a subset of primary tumor ceil popula- 
tion. Two major advances in cytogenetic analytic tech- 
niques, in situ hybridization (rSH) and comparative 
genomic hybridization (CGH), have allowed better reso- 
lution of chromosomal aberrations in freshly isolated 
tumor cells [6]. ISH analyses with chromosome-specific 
DNA probes, a powerful adjunct to metaphasic analysis, 
allows assessment of chromosomal anomalies within 
tumor cell populations in the contexts of whole nuclear 
architecture and tissue organization. CGH allows 
genome wide screening of chromosomal anomalies 
without the use of specific probes even in the absence 
of prior knowledge of chromosomes involved. Although 
both techniques have certain limitations in terms of 
their resolution power, they nonetheless provide a 
better approximation of chromosomal changes occurring 
among tumors of various histology, grade, and stage 



compared with what was possible with the classic Cyto- 
genetic techniques. Genomic ploidy measurements 
have also been performed at the DNA level with flow 
cytometry and cytofluoromctric methods. Although 
these assays underestimate chromosome ploidy due to a 
chromosomal gain occasionally masking a chromosomal 
loss in the same cell, several studies using these 
methods have supported the conclusion that DNA 
aneuploidy closely associates with poor prognosis in 
various cancers [7,81. This discussion of some recent 
examples published on aneuploidy in cancer includes 
discussion of studies dealing with DNA ploidy measure- 
ments as well Most of these observations arc correlative 
without direct proof of specific involvement of genes on 
the respective chromosomes. Identification of putative 
oncogenes and tumor suppressor genes on gained and 
lost chromosomes in aneupioid tumors,, however, arc 
providing strong evidence that chromosomes involved in 
aneuploidy play a critical role in the tumorigenic 
process. 

In renal tumors, cither segmental or whole chromosome 
aneuploidy appears to be uniquely associated with 
specific histologic subtypes [9]. Tumors from patients 
with hereditary papillary renal carcinomas (HPRC) 
commonly show trisomy of chromosome 7, when 
analyzed by CGH. Germlinc mutations of a putative 
oncogene MFS have been detected in patients with 
HPRC. A recent study [ 10] has demonstrated that an 
extra copy of chromosome 7 results in nonrandom dupli- 
cation of the mutant MKT allele in HPRC, thereby 
implicating this trisomy in tumorigencsis. The study 
suggested that mutation of MET may render the cells 
more susceptible to errors in chromosome replication, 
and that clonal expansion of cells harboring duplicated 
chromosome 7 reflects their proliferative advantage. In 
addition to chromosome 7, trisomy of chromosome 17 in 
papillary tumors and also of chromosome 8 in mesoblas- 
tic nephroma arc commonly seen. Association of specific 
chromosome imbalances with benign and malignant 
forms of papillary renal tumors, therefore, not only 
contribute to an understanding of tumor origins and 
evolution, but also implicate aneuploidy of the respec- 
tive chromosomes in the tumorigenic transformation 
process. 

In colorectal tumors, chromosome aneuploidy is a 
common occurrence. In fact, molecular allclotyping 
studies have suggested that limited karyotyping data 
available from these rumors actually underestimate the 
true extent of these changes. Losses of heterozygosity 
reflecting loss of the maternal or paternal allele in 
tumors are widespread and often accompanied by a gain 
of the opposite allele* Therefore, for example, a tumor 
could lose a maternal chromosome while duplicating 
the same paternal chromosome, leaving the tumor cell 
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with a normal karyotype and ploidy but an aberrant 
allctotypc. It has been estimated that cancer of the 
colon, breast, pancreas, or prostate may lose an average 
of 25% of its alleles. It is not unusual to discover that a 
tumor has lost over half of its alleles |4|. In clinical 
settings, DNA ploidy measurements have revealed that 
DNA ancuploidy indicates high risk of developing 
severe prcmalignant changes in patients with ulcerative 
colitis, who arc known to have an increased risk of 
developing colorectal cancer [til. DNA ancuploidy has 
been found to be one of the useful indicators of lymph 
node metastasis in patients with gastric carcinoma and 
associated with poor outcome compared with diploid 
cases [12,131. CGIf analyses of chromosome ancu- 
ploidy, on the other hand, was reported to correlate gain 
of chromosome 20q with high tumor S phase fractions 
and loss of 4q with low tumor u pop to tie indices {14]. 
Ancuploidy of chromosome 4 in metastatic colorectal 
cancer has recently been confirmed in studies that used 
unbiased DNA fingerprinting with arbitrarily primed 
polymerase chain reactions to detect moderate gains 
and losses of specific chromosomal DNA sequences 
[15]. The molecular karyotype (amplotypc) generated 
from colorectal cancer revealed that moderate gains of 
sequences from chromosomes 8 and 13 occurred in 
most tumors, suggesting that overreprcscntation of 
these chromosomal regions is a critical step for metasta- 
tic colorectal cancer. 

In addition to being implicated in tumorigenesis and 
correlated with distinct tumor phenotypes, chromosome 
ancuploidy has been used as a marker of risk assessment 
and prognosis in several other cancers. The potential 
value of aneuploidy as a noninvasive tool to identify 
individuals at high risk of developing head and neck 
cancer appears especially promising. Interphase fluores- 
cence /// stilt hybridization (FI*SH) revealed extensive 
ancuploidy in tumors from patients with head and neck 
squamous cell carcinomas (HNSCC) and also in clini- 
cally normal distant oral regions from the same individu- 
als [16,17]. It has been proposed that a panel of chromo- 
some probes in FISH analyses may serve as an 
important tool to detect subclinical tumorigenesis and 
for diagnosis of residual disease. The presence of aneu- 
ploid or tctraploid populations is seen in 90% to 95% of 
esophageal adenocarcinomas, and when seen in 
conjunction with Barrett's esophagus, a prcmalignant 
condition, predicts progression of disease [18,19], 
Chromosome ploidy analyses in conjunction with loss of 
heterozygosity and gene mutation studies in Barrett's 
esophagus reflect evolution of neoplastic cell lineages in 
vivo |201. Evolution of neoplastic progeny from Barrett's 
esophagus following somatic genetic mutations 
frequently involves bifurcations and loss of heterozygos- 
ity at several chromosomal loci leading to ancuploidy 
and cancer. Accordingly, it is hypothesized that during 



tumor cell evolution diploid cell progenitors with 
somatic genetic abnormalities undergo expansion with 
acquired genetic instability. Such instability, often 
manifested in the form of increased incidence of ancu- 
ploidy, enters a phase of clonal evolution beginning in 
prcmalignant cells that proceeds oyer a period of time 
and occasionally leads to malignant transformation. The 
clonal evolution continues even after the emergence of 
cancer. 

The significance of DNA and chromosome ancu- 
ploidy in other human cancers continue to be evalu- 
ated. Among papillary thyroid carcinomas, ancuploid 
DNA content in tumor cells was reported to correlate 
with distant metastases, reflecting worsened progno- 
sis (21 1. Genome wide screening of follicular thyroid 
tumors by CGH, on the other hand, revealed, frequent 
loss of chromosome 22 in widely invasive follicular 
carcinomas [22]. Chromosome copy number gains in 
invasive neoplasm compared with foci of ductal carci- 
noma in situ (DCIS) with similar histology have been 
proposed to indicate involvement of ancuploidy in 
progression of human breast cancer [23]. LSH analyses 
of cervical intraepithelial neoplasia has provided 
suggestive evidence chat chromosomes 1, 7 and X 
ancusomy is associated with progression coward cervi- 
cal carcinoma (24]. 

Although the prognostic value of numeric aberrations 
remains a matter of debate in human hematopoietic 
neoplasia, there have been recent studies to suggest that 
the presence of monosomy 7 defines a distinct subgroup 
of acute myeloid leukemia patients [25]. It is interesting 
in this context that therapy-related myelodysplasia* 
syndromes have been reported to display monosomy 5 
and 7 karyotypes* reflecting poor prognosis [26}. 

The clinical observations, mentioned previously, arc 
supported by in vitro studies in human and rodent cells in 
which ancuploidy is induced at early stages of transforma- 
tion [27,28]. It is even suggested that ancuploidy may 
cause eel! immortalization, in some instances, that is a 
critical step prccccding transformation. 

Finally, in an interesting study to develop transgenic 
mouse models of human chromosomal diseases, chromo- 
some segment specific duplication and deletions of the 
genome were reported to be constructed in mouse 
embryonic stem cells (29]. Three duplications for a 
portion of mouse chromosome 1 1 syntcnic with human 
chromosome 17 were established in the mouse 
germline. Mice with 1Mb duplication developed corneal 
hyperplasia and thymic tumors. The findings reprcsenr 
the first transgenic mouse model of ancuploidy of a 
defined chromosome segment that documents the direct 
role of chromosome ancusomy in tumorigenesis. 
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Aneuploidy as dynamic cancer-causing 
mutation" instead of a "consequential state" 
in cancer 

According to the hypothesis previously discussed, aneu- 
ploidy represents either a "gain of function" or "loss of 
function' 1 , mutation at the chromosome Jevcl with a 
causative influence on the tumorigenesis process. The 
hypothesis, however, is based only on circumstantial 
evidence even though existence of ancuploidy is corre- 
lated with different tumor phenotypes. The existence of 
numeric chromosomal alterations in a tumor docs not 
mean that the change arose as a dynamic mutation due 
to genomic instability, because several factors could lead 
to consequential aneuploidy in tumors, also. Although 
ancuploidy as a dynamic mutation due to genomic insta- 
bility in tumor cells would occur at a certain measurable 
rate per cell generation, a consequential state of ancu- 
ploidy in tumors may not occur at a predictable rate 
under similar conditions or in tumors with similar 
phenotypes. In addition to genomic instability, differ* 
ences in environmental factors with selective pressure, 
could explain high incidence of ancuploidy and other 
.somatic mutations in tumors compared with normal cells 
[4]. These. include humoral, cell substratum, and cell- 
cell interaction differences between tumor and normal 
cell environments. Ic could be argued that despite 
similar rates of spontaneous aneuploidy induction in 
normal and tumor cells, the latter are selected to prolif- 
erate due to altered selective pressure in the tumor cell 
environment, whereas the normal cells are eliminated 
through activation of apoptosts. Alternatively, of course, 
one could postulate that selective expression or overcx- 
prcssion of anti-apoptotic proteins or inactivation of 
proapoptotic proteins in tumor cells may counteract 
default induction of apoptosis in G2/M phase cells 
undergoing misscgrcgation of chromosomes. Recent 
demonstration of ovcrcxprcssion of a G2/M phase antt- 
apoprotic protein survivin in cancer cells {30] suggests 
that this protein may favor aberrant progression of aneu- 
ploid transformed cells through mitosis. This would 
then lead to proliferation of aneuploid cell lineages, 
which may undergo clonal evolution. 

To ascertain that ancuploidy is a dynamic mutational 
event, various human tumor cell lines and transformed 
rodent cell lines have been analyzed for the rate of 
ancuploidy induction. When grown under controlled in 
vitro conditions, such conditions ensure that environ* 
mental factors do not influence selective proliferation of 
ceils with chromosome instability. In one study, 
Lcngaucr et ai [31*1 provided unequivocal evidence by 
FISH analyses that losses or gains of multiple chromo- 
somes occurred in excess of 10** per chromosome per 
generation in aneuploid colorectal cancer cell lines. The 
study further concluded that such chromosomal instabil- 
ity appeared to be a dominant trait. Using another in 
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vitro model system of Chinese hamster embryo (CHE) 
cells, Duesbcrg a ai [32*1 have also obtained similar 
results. With clonal cultures of CHE cells, transformed 
with nongenotoxic chemicals and a mitotic inhibitor, 
these authors demonstrated that the overwhelming 
majority of the transformed colonies contained more 
than 50% aneuploid cells, indicating that aneuploidy 
would have originated from the same ceils that under- 
went transformation. All the transformed colonies tested 
were tumorigenic. It was further documented that the 
ploidy factor representing the quotient of the modal 
chromosome number divided by the normal diploid 
number, in each clone, correlated directly with the 
degree of chromosomal instability. Therefore, chromo- 
somal instability was found proportional to the degree of 
ancuploidy in the transformed cells and the authors 
hypothesized that ancuploidy is a unique mechanism of 
simultaneously altering and destabilizing, in a massive 
manner, the normal cellular phenotypes. In the absence 
of any evidence that the transforming chemicals used in 
the study did not induce other somatic mutations* ic is 
difficult to rule out the contribution of such mutations 
in the transformation process. These results nonetheless 
make a strong case for aneuploidy being a dynamic chro- 
mosome mutation event intimately associated with 
cancer. 

Aneuploidy versus somatic gene mutation in 
cancer 

The idea that numeric chromosome imbalance or aneu- 
ploidy is a direct cause of cancer was proposed at the 
turn of the century by Theodore Bovcri [331. However, 
the hypothesis was largely ignored over the last several 
decades in favor of the somatic gene mutation hypothe- 
sis, mentioned earlier. Evidence accumulating in the 
literature lately on specific chromosome aneusomies 
recognized in primary tumors, incidence of ancuploidy 
in cells undergoing transformation, and aneuploid tumor 
cells showing a high rate of chromosome instability have 
led to the rejuvenation of Bovcri's hypothesis. The 
concept has recently been discussed as a "vintage wine 
in a new bottle** [34»]. The author points out that 
except for rare cancers caused by dominant retroviral 
oncogenes, diploidy does not seem to occur in solid 
tumors, whereas aneuploidy is a rule rather than excep- 
tion in cancer. 

Ancuploidy as an effective mutagenic mechanism 
driving tumor progression, on the other hand, is being 
recognized as a viable solution to the paradox that with 
known mutation rate in non-gcrmlinc cells (~10~ 7 per 
gene per cell generation) tumor cell lineages cannot 
accumulate enough mutant genes during a human life- 
time [351. The concept is gaining significant credibility 
since genes that potentially affect chromosome segrega- 
tion were found mutated in human cancer. Some of 
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these genes have also been shown to have transforming 
capability in in vitro assays. Selected recent publications 
describing the findings are being discussed below in 
reference to the mitotic targets potentially involved in 
inducing chromosome segregation anomalies in cells. 

Potential mitotic targets and molecular 
mechanisms of aneuploidy 

Because aneuploidy represents numeric imbalance in 
chromosomes, it is reasonable to expect that aneuploidy 
arises due to misscgregarion of chromosomes during ceil 
division. There arc many potential mitotic targets, 
which could cause unequal segregation of chromosomes 
(Fig. 1). Recent investigations have identified several 
genes involved in regulating these mitotic targets and 
mitotic checkpoint functions, which can be implicated 
in induction of aneuploidy in tumor cells. This discus- 
sion is restricted to those mitotic targets and checkpoint 
genes whose abnormal functioning has been observed in 
cancer or has been shown to cause tumorigenic transfor- 
mation of cells, in recent years. The role of telomeres is 
discussed elsewhere in this issue. For a more detailed 
description of the components of mitotic machinery and 
their possible involvement in causing chromosome 
segregation abnormalities in tumor cells, readers may 
refer to a recently published review [36«], 

Among the mitotic targets implicated in cancer, ccntro- 
some defects have been observed in a wide variety of 
malignant human tumors. Centrosomcs play a central role 
in organizing the microtubule network in interphase cells 
and mitotic spindle during cell division* Multipolar 
mitotic spindles have been observed in human cancers in 
situ and abnormalities in the form of supernumerary 
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centrosomcs, centrosomcs of aberrant size and shape as 
well as aberrant phosphorylation of cencrosorne proteins 
have been reported in prostate, colon* brain, and breast 
tumors [37,38]. In view of the findings that abnormal 
centrosomcs retain the ability to nucleate microtubules in 
vitro, it is conceivable that cells with abnormal centro- 
somcs may missegregatc chromosomes producing ancu- 
ploid cells. The molecular and genetic bases of abnormal 
centrosomc generation and the precise pathway through 
which they regulate the chromosome segregation process 
remain to be elucidated. Recent discovery of a ccntro* 
some-associated kinase STKI5/BTAK/aurora2, naturally 
amplified and overexpresscd in human cancers, has raised 
the interesting possibility that aberrant expression of this 
kinase is critically involved in abnormal centrosomc func- 
tion and unequal chromosome segregation in tumor cells 
[39,40]. Exogenous expression of the kinase in rodent and 
human cells was found to correlate with an abnormal 
number of centrosomcs, unequal partitioning of chromo- 
somes during division, and tumorigenic transformation of 
cells. Tt is rclcvanrin this context to mention that the 
Xenopus homologuc of human STKl5/BTAK/aurnra2 
kinase has recently been shown to phosphorylate a micro- 
tubule motor protein XlKgS, the human orthologtic of 
which is known to participate in the centrosomc separa- 
tion during mitosis J411- Findings on STK15/aurora2 
kinase, thus, provide an interesting lead to a possible 
molecular mechanism of ccntrosomc's role in oncogene- 
sis. Centrosomcs have, of late, been implicated in onco- 
genesis from studies revealing supernumerary centro- 
somcs in ^5.?-deficicnt fibroblasts and overexprcssion of 
another centrosomc kinase PLK1 being detected in 
human non-small cell lung cancer [42], 

One of the critical events that ensures equal partition- 
ing of the chromosomes during mitosis is the proper 
and timely separation of sister chromatids that arc 
attached to each other and to the mitotic spindle. 
Untimely separation of sister chromatids has been 
suspected as a cause of aneuploidy in human tumors. 
Cohesion between sister chromatids is established 
during replication of chromosomes and is retained until 
the next mctaphasc/anaphase transition. It has been 
shown that during mctaphasc-anaphasc transition, the 
anaphase promoting complex/cyclosome triggers the 
degradation of a group of proteins called sccurins that 
inhibit sister chromatid separation. K vertebrate sccurin 
(v-sccurin) has recently been identified that inhibits 
sister chromatid separation and is involved in transfor- 
mation and tumorigencsis. Subsequent analysis 
revealed that the human sccurin is identical to the 
product of the gene called pituitary tumor transforming 
gene, which is overexpresscd in some tumors and 
exhibits transforming activity in NIH3T3 cells. Jr is 
proposed that elevated expression of the v-sccurin may 
contribute to generation of malignant tumors due to 
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chromosome gain or loss produced by errors in chro- 
matid separation [43*1. 

Normal progression through mitosis during prophase to 
anaphase transition is monitored at least at two check- 
points: One checkpoint operates during early prophase 
at GZ to meraphase progression while the second 
ensures proper segregation of chromosomes during 
mctuphasc to anaphase transition. Several mitotic 
checkpoint genes responding to mitotic spindle defects 
have been identified in yeast. The metaphase-anaphase 
transition is delayed following activation of this check* 
point during which kinetochorcs remain unattached to 
the spindle. The signal is transmitted through a kineto- 
chorc protein complex consisting of Mpslp and several 
Mad and Bub proteins [44]. It is expected that for 
unequal chromosome segregation to be perpetuated 
through cell proliferation cycles giving rise, to aneu- 
ploidy, checkpoint controls have to be abrogated. 

Following this logic, Vogelstcin ttaL [4S»] hypothesized 
that ancuploid tumors would reveal mutation in mitotic 
spindle checkpoint genes. Subsequent studies by these 
investigators have proven the validity of this hypothesis 
and a small fraction of human colorectal cancers have 
revealed the presence of munitions in cither hBubl or 
hBubR! checkpoint genes. It was further revealed that 
mutant BUBl could function in a dominant negative 
manner conferring an abnormal spindle checkpoint 
when expressed exogenously. Inactivatton of spindle 
checkpoint function in viraliy induced leukemia has also 
recently been documented following the finding that 
hMADl checkpoint protein is targeted by the Tax 
protein of the human T-ccll leukemia virus type !, 
Abrogation of hMADl function leads to multtnuclcation 
and aneuploidy [46]. 

In addition to mitotic spindle checkpoint defects, failed 
DNA damage checkpoint function in yeast is frequently 
associated with aberrant chromosome segregation as 
well. It, thercford appears intriguing yet relevant that 
the human BRCAl gene, proposed to be involved in 
DNA damage checkpoint function, when mutated by a 
targeted deletion of exon 1 1 led to defective G2/M cell 
cycle checkpoint function and genetic instability in 
mouse embryonic fibroblasts [47]. The cells revealed 
multiple functional centrosomes and unequal chromo- 
some segregation and aneuploidy. Although the molecu- 
lar basis for these abnormalities is not known at this 
time, it raises the interesting possibility that such an 
ancuploidy-drivcn mechanism may be involved in 
tumorigenesis in individuals carrying gcrmline muta- 
tions of BRCA / gene. 
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Conclusion 

Growing evidence from human tumor cytogenetic inves- 
tigations strongly suggest that aneuploidy is associated 
with the development of tumor phenotypes. Clinical 
findings of correlation between aneuploidy and tumori- 
genesis arc supported by studies with in vitro grown 
transformed cell lines* Molecular genetic analyses of 
tumor cells provide credible evidence that mutations in 
genes controlling chromosome segregation during 
mitosis play a critical role in causing chromosome insta- 
bility leading to aneuploidy in cancer. Further elucida- 
tion of molecular and physiologic bases of chromosome 
instability and aneuploidy induction could lead to the 
development of new therapeutic approaches for 
common forms of cancer. 
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Amplification and overexpression of putative oncogenes 
confer growth advantages for tumor development We 
used a functional genomic approach that integrated 
simultaneous genomic and transcript microarray, proteo- 
mics, and tissue microarray analyses to directly identify 
putative oncogenes in lung adenocarcinoma. We first 
identified 183 genes with increases in both genomic copy 
number and transcript in six lung adenocarcinoma cell 
lines.- Next, we used two-dimensional polyacryJamide gel 
electrophoresis and mass spectrometry to identify 42 
proteins that were overexpressed in the cancer cells 
relative to normal cells. Comparing the 183 genes with 
the 42 proteins, we identified four genes - PRDXU 
EEF1A2, CALR, and KCIP-1 - in which elevated protein 
expression correlated with both increased DNA copy 
number and increased transcript levels (all r>0.84, two- 
sided F<0.05). These findings were validated by South- 
ern, Northern, and Western blotting. Specific inhibition of 
EEF1A2 and KCIP-1 expression with siRNA in the four 
cell lines tested suppressed proliferation and induced 
apoptosis. Parallel fluorescence in situ hybridization and 
immunobistochemical analyses of EEF1A2 and KCIP-1 in 
tissue microarrays from patients with lung adenocarcinoma 
showed that gene amplification was associated with high 
protein expression for both genes and that protein 
overexpression was related to tumor grade, disease stage, 
Ki-67 expression, and a shorter survival of patients. The 
amplification of EEF1A2 and KCIP~1 and the presence of 
overexpressed protein in tumor samples strongly suggest 
that these genes could be oncogenes and hence potential 
targets for diagnosis and therapy in lung adenocarcinoma, 
Oncogene (200Q 25, 2628-2635. doi: 10. 1038/sj.onc. 1 209289; 
published online 12 December 2005 

Keywords: lung cancer; microarrays; proteomics; tissue 
microarray 



Correspondence: Ass i slant Professor P Jiang, Department of Patho- 
logy, The University of Maryland School of Medicine, 10 South Pine 
Street, MSTF 7lh floor. BalUmorc, MD 21201-1192, USA. 
E-mail: f3iang@50ro.umaryiand.edu 

Received 27 July 2005; revised 19 October 2005; accepted 27 October 
2005; published online 12 December 2005 



Introduction 

In lung adenocarcinoma, as in other types of cancer, 
gene amplification and the consequent overexpression of 
the amplified oncogene play an important role in the 
development of tumors, because their overexpression 
confers a growth advantage. The ability to identify 
putative oncogenes that are activated during turaorigen- 
esis could facilitate the choice of molecular genetic 
targets for diagnosis and therapy of the disease. This 
concept has been exemplified by HER-2, which was first 
found to be amplified in neuroblastomas and subse- 
quently shown to be associated with poor prognosis in 
breast cancer (Ross and Fletcher, 1999). Now, HER-2 
aberrations are used as a predictor of response to 
therapy, and treatment of HER-2-positive breast cancer 
with the monoclonal anti-HER-2 antibody trastuzumab 
has been shown to improve prognosis (Ross and 
Fletcher, 1999). Emerging evidence of common ampli- 
cons in lung adenocarcinomas (Luk et aL, 2001; Jiang 
et aL, '2004; Tonon et a/., 2005) suggests that additional 
oncogenes remain to be identified; however, conven- 
tional techniques are ineffective in pinpointing such 
oncogenes. Parallel measurement of DNA copy number 
and mRNA levels in cDNA microarrays permits 
changes in copy number to be compared with transcrip- 
tion levels on a gene-by-gene basis to generate lists of 
candidate genes within the defining amplicons (Hyman 
et a/., 2002; Pollack et a/ M 2002). However, use of 
transcript patterns does not allow assessment of the 
expression of protein products or identification of proto- 
oncogenes. Another approach, identifying differentially 
expressed proteins by proteomic analysis and then 
comparing the proteins present with mRNA expression 
in cDNA microarrays from the same specimens, can 
clarify the extent to which changes in transcript patterns 
reflect changes in their cognate proteins and post- 
transcriptional mechanisms (Chen et a/., 2002), but this 
approach cannot be used to identify oncogenes driven 
by extensive increases of their gene copy number. 
Moreover, using individual microarrays or proteomic 
approaches alone cannot distinguish the cancer-driving 
oncogenes that directly propel tumor progression from 
the larger number of passenger genes that may be 
concurrently over-represented but are not biologically 
relevant in tumor development. 
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In this study, we used a comprehensive approach that 
integrated simultaneous comparative genomic hybridi- 
zation (CGH) and transcript microarray with proteomic 
analyses of six lung adenocarcinoma cell lines. We 
directly and specifically identified four putative onco- 
genes that could have been activated through amplifica- 
tion and consequent elevation of transcript expression. 
We used small interfering RNA (siRNA) to inhibit the 
expression of two of these four genes in the lung cancer 
cell tines, which further implicated them in oncogenesis. 
We then explored the clinical significance of these 
findings by assessing the expression of these two genes 
in tissue microarrays of human lung cancer specimens. 
Our findings underscore the power of integrated 
functional genomic analyses for identifying putative 
oncogenes in tumorigenesis; such activated genes could 
be useful as targets for diagnosis or therapy in lung 
cancer. 



Results 

Simultaneous global genomic and transcript analyses 
identify 183 genes with increases in genomic copy 
numbers and transcript expression levels 
To identify genes in which increased DNA copy number 
might contribute to increased transcript in lung adeno- 
carcinomas, first we used CGH with microarrays of six 
lung adenocarcinoma cell lines. We identified 587 genes 
showing increases in DNA copy number across all six 
cell lines (Supplementary Table IS), which were 
distributed as 90 amplicons on all chromosomes except 
for chromosomes 13 and Y (Supplementary Table 2S). 
A subsequent transcript test with the identical arrays of 
the same cell lines revealed 275 genes that showed 
increased mRNA levels (Supplementary Table 3S), 
Using random permutation tests across all cancer cell 
lines, we identified 183 genes (31%) that showed 
elevated transcript levels from the 587 genes that were 
over-represented in the genome (Table 1), suggesting 
that elevated transcript levels of the 183 genes may 
reflect their genomic over-representation in the cancer 
cells. These findings are consistent with previous reports 
Unking genomic changes with altered transcript patterns 
in breast cancer (Hyman et al. % 2002; Pollack et a/., 
2002). However, our finding that only 31% of the genes 
showing increased DNA copy numbers had cognate 
increases in transcript expression in lung adenocarcino- 
mas is different from the overall rates of 40-60% 
reported for breast cancer (Hyman et al. t 2002; Pollack 
et al. 9 2002). This discordance may reflect methodologic 
differences between studies or biological differences 
between breast cancer and lung adenocarcinoma. 

Proteomic analyses identify four genes for which protein 
abundance was associated with increases in the cognate 
gene and transcript levels 

Analysis of transcript patterns is insufficient for under- 
standing the expression of protein products and the 
effect of genomic over-representation on the expression 
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of their cognate proteins. To extend these findings 
beyond genomic over-representation to expression of 
the protein products of those genes, we next assessed 
protein expression in the same cell lines by two- 
dimensional polyacrylamide gel electrophoresis (PAGE) 
and found that 42 different proteins, representing 42 
individual genes, were significantly increased in the 
cancer cell lines (Table 2; Supplementary Figures IS and 
2S). Some of these proteins were identified as having 
multiple isoforms, and all individual isoforms exhibited 
increases in expression ranging from 4.6 to 12.8 times 
their expression in normal lung tissue cells. In compar- 
ing protein level of the 42 genes with changes in their 
cognate genomic and mRNA expression from the global 
microarray analyses, we found that four (9.5%) of those 
42 genes - PRDX1 % EEF1A2, CALR, and KCIP-1 - 
showed statistically significant correlations between 
elevated protein expression and increases in both copy 
number and mRNA expression (all r>0.84; P<0.05) 
(Table 2) in the cancer ceil lines. These findings imply 
that the abundance of these four proteins is attributable 
to the amplification and consequent elevated transcrip- 
tion of their cognate genes. 



Validation of copy number, transcript, and protein 
expression ofPRDXl. EEFJA2, CALR. and KCIP-1 
in lung cancer cell lines 

To confirm our findings from the high-throughput 
analyses, we next used Southern, Northern, and Western 
blotting to assess DNA, RNA, and protein levels for the 
four genes identified in the six cell tines. For compar- 
ison, we arbitrarily chose one gene, NFKBI, in which an 
increase in protein level did not correlate with genetic 
changes. Overall, we found excellent concordance 
between the CGH microarray and Southern blotting 
analyses, transcript array and Northern blotting ana- 
lyses, and proteomic and Western blotting analyses for 
all five genes (Figure I), For example, KCIP-1 showed 
fivefold amplication in five of the six cancer cell lines, 
whereas NFKB1 showed no such increase in any of the 
cell lines. As for transcript expression, Northern blotting 
of EEF1A2 showed high expression in five of the six 
cancer cell lines; again, levels of NFKB1 transcript were 
not increased in any cancer cell line as compared with 
normal bronchial epithelial cells. The results of Western 
blotting were also consistent with the results of the 
proteomic experiments; for example, five of the cancer 
cell lines exhibited strong protein bands for PRDX1 as 
compared with normal cells. These findings provide 
strong support for the validity of the results derived 
from the high-throughput techniques in this study. 

These parallel analyses also revealed close correla- 
tions in the extent of changes in gene copies, transcript, 
and protein of each of the four genes in the cancer cell 
lines. For example, in the five cancer cell lines that 
showed at least fourfold increases in EEF1A2 copy 
number, expression of transcript and protein was also 
increased by at least a factor of four as wel! (relative to 
their expression in normal cells) (Supplementary Figure 
3S). The protein abundance of the four genes showing 
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Table 1 List of 183 genes with statistically significant correlation Table I {continued) 
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Table 1 {continued) 
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corresponding increases in both DNA copy number and 
mRNA provides further evidence that these could be 
oncogenes, the activation of which is reflected by 
genomic amplification and consequent increases in 
transcript level in lung adenocarcinoma cell lines. 

Specific inhibition of EEF1A2 and KCIP*1 expression by 
siRNAs led to decreased cell proliferation and induction of 
apoptosis 

To further prove the oncogenic function of the identified 
genes in lung tumorigenesis, we used siRNAs to inhibit 
the endogenous expression of EEF1A2 and KCIP-1 
protein in four lung cancer cell lines (HI 563, H229, 
H522, and SK-LU). Transfection of the cancer cells with 
specific siRNAs reduced the level of EEFIA2 and 
KCIP-1 protein by 70-90% 48 h after transfection 



(Supplementary Figure 4S). In contrast, EEFIA2 and 
KCIP-I protein levels remained unchanged in mock- 
treated control cells and in cells transfected with a 
scrambled siRNA sequence. At 48 h after siRNA 
transfection, the percentage of proliferation of the 
transfected cancer cells was reduced to 15-30% as 
compared with 91-100% of cell proliferation of the 
same cell lines treated with PBS or scrambled siRNA 
(Supplementary Figure 5S). Apoptosis of siRNA- 
transfected cells was 27-34%, whereas only 4% of the 
same cell lines treated with PBS or scrambled siRNA 
showed apoptosis. These results strongly support an 
oncogenic role for the identified genes in lung cancer and 
confirm their potential usefulness as therapeutic targets 
for the disease. 



Amplification and protein expression of KCIP-1 and 
EEF1A2 in lung tissue 

To further validate these findings and to assess the 
possible clinical significance of the four potential 
putative oncogenes identified from the cell lines, we first 
applied fluorescence in situ hybridization and immuno- 
histochemjcai analysis, in parallel, to commercially 
available human lung tissue microarrays (Ambion , 
Austin, TX, USA) to evaluate the status of two of these 
four genes in lung cancer tissue specimens. (Commer- 
cially available antibodies to PRDXl or CALR were 
not suitable for use in immunohistochemical analysis 
when this report was written.) Overexpression of KCIP- 
1 and EEF1A2 protein in the tumors was concordant 
with amplification of the corresponding genes 
(P = 0.0003 for KCIP-1 and P = 0.0011 for EEF1A2). 
For example, 16 (35%) of the 46 lung adenocarcinomas 
in the microarray showed amplification of KCfP-1, and 
strong cytoplasmic staining for KCIP-1 protein was seen 
in 18 tumors (39%) (Figure 2). We next examined 
whether overexpression of these genes was associated 
with increased cell proliferation by anatysing Ki-67 
expression in contiguous sections of the tissue micro- 
arrays. Positive Ki-67 expression was found to correlate 
with positive expression of both KCIP-1 (/>=0,02) and 
EEF1A2 (/> = 0.01). To extend these findings, we then 
studied 1 1 tissue microarray blocks comprising normal 
and tumor tissue specimens from 113 patients with 
pathologic stage I non-small-cell lung cancer who had 
undergone curative surgery (Wang et aL, 2005). 
Immunohistochemical analysis showed that EEF1A2 
was expressed in 32 cases (28%) and KCIP-1 in 29 cases 
(26%). Univariate and multivariate Cox proportional 
hazards models were used to detect possible associations 
between EEFIA2 and KCIP-1 expression and clinico- 
pathologic variables. Expression of EEFIA2 or KCIP-l 
was associated with short overall survival time 
(^ = 0.0012 for EEF1A2 and ^ = 0.0026 for KCIP-1) 
(Supplementary Figure 6S). Age at diagnosis, histologic 
type of cancer, degree of tumor differentiation, and 
smoking history were not associated with survival time. 

Although only two genes were validated in the lung 
tissue microarrays (because available antibodies to 
the other two genes were not suitable for use in 
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Table 2 Proteins showing significant overexpression in cancer cell Uaes relative to those in normal bronchial epithelial cell lines and their 
correlation coefficients with increased fj>NA copy number or mRNA values* 



Ace. no. 


Gene ID 


Gene 


Mw/pl 


Description 


r with geno* 
mic copy 
changes* 


r with mRNA 
changes* 


Q06830 


5052 


PRDXl 


48.4/5.4 


Peroxiredoxin I 


0.92364 


0.91892 


Q05639 


1917 


EEF1A2 


50.5/5.7 


Eulcaryotic translation elongation factor 1 alpha 2 


0.90218 


0.89456 


P27797 


811 


CALR 


61/5.5 


Calreticulin 


0.84128 


0.86434 


P63KM 


7534 


KCIP-I 


27/6.5 


Tyrosine 3-monooxygenase activation protein, zeta 


0.84467 


0.85499 


P07237 


5034 


P4HB 


54/6.2 


Procollagen-proline, 2-oxoglutarate 4-di oxygenase 


0.91884 


0.76786 


Q04695 


3872 


KRTI7 


48.0/4.9 


Keratin 17 


0.00236 


0.86892 


P092U 


2950 


GSTP1 


23.2/4.7 


Glutathione ^-transferase pi 


0.84218 


0.69456 


P17936 


3486 


IGFBP-3 


31.6/5.8 


Insulin-like growth-factor binding protein 3 


0.06412 


0.16434 


P2664I 


1937 


EEF1G 


50/6.4 


Eukaryotic translation elongation factor 1 gamma 


0.00446 


0.85549 


P08727 


3880 


KRTI9 


44.1/5.2 


Keratin 19 


-0.04884 


0.86786 


P04792 


3315 


HSPB1 


22/6.5 


Heat shock 27 IcDa protein 1 


0.00364 


0.31892 


P00558 


5230 


PGK1 


44.5/4.2 


Phosphoglycerate kinase 1 


0.50402 


0.79456 


Q01995 


5876 


TAGLN 


22.5/4.3 


Transgelin 


-0.34128 


-0.26434 


P0863I 


3055 


JTK9 


59.5/6.8 


Hemopoietic cell kinase 


-0.01446 


0.02549 


P09382 


3956 


LGALSl 


16/5.5 


Galectin-I, galactoside-binding, soluble, 1 


0.026623 


0.01123 


Q92784 


8)10 


DPF3 


25.8/4.8 


D4, zinc and double PHD fingers, family 3 


0.094884 


-0.03214 


P54257 


9001 


HAPI 


75.5/6.5 


Huntington-assoctated protein 1 


0.12364 


-O.08I08 


F05783 


3875 


KRT18 


48/5.3 


Keratin 18 


0.010218 


0.60544 


P05787 


3856 


K.RT8 


9.2/4.4 


Keratin 8 


0.041280 


0.84566 


P00738 


3240 


HP 


55.776.2 


Haptoglobin 


0.044679 


-0.14501 


P09769 


2268 


FGR 


59.5/5.2 


Gardner-R&sheed feline sarcoma viral oncogene homolog 


0.031264 


-0.13789 


P19838 


4790 


NFKB1 


50.4/6.3 


Nuclear factor of kappa light gene enhancer in B-cells 1 


0.04467 


-0.14501 


- P29034" 


6273 


S100A2 


~ 10.9/4.6 


S 100 calcium-binding protein A2 


0.87964 


0.243214 




77HO 


£>DJ Dl / 




Zinc-finger and BTB domain containing 17 


—0.17636 


0.048108 


Q00987 


4193 


MDU2 


75.2/4.8 


Transformed 3T3 cell double minute 2 


-0.19782 


-0.50544 


P27816 


4)34 


MAP4 


1 11/5.4 


Microtubulc-associated protein 4 


0.25872 


-0.05356 


P52732 


3832 


KJFII 


119.2/6.2 


Kinesin family membeT 1 1 


-0.25778 


-0.53444 


P25205 


4172 


MCM3 


90.9/5.5 


fofinichromofiORie mnfntfrnnnr^ HpfirJpnt 1 


n 7^/ULd 


ft 


P08631 


3055 


HOC 


59.5/5.7 


Hemopoietic cell kinase 


0.65533 


0.054501 


P09237 


4316 


MMP7 


22,6/5.8 


Matrix metalloprotetnase 7 


0.234987 


0.876820 


P3030S 


994 


CDC25B 


64.9/4.5 


Cell division cycle 25B 


0.045116 


0.283214 


P50290 


998 


CDC42 


213/6.1 


Cell division cycle 42 (GTP-binding protein, 25kDa) 


-0.47636 


0.088108 


P6IS86 


387 


RHOA 


19.8/6.9 


Ras homolog gene family, member A 


-0.49782 


-0.00544 


P63000 


5879 


RAC1 


21.5/6.8 


Ras-related C3 botulinum toxin substrate I 


-0.05583 


-0.03566 


P07437 


203068 


TUBB 


49.6/6.5 


Tubulin, beta polypeptide 


0.255533 


0.145010 


P24864 


898 


CCNEI 


47.1/4.3 


Cyclin El 


-0.651 !6 


0.232149 


P04I41 


1437 


CSF2 


16.9/6.3 


Colony stimulating factor 2 (granulocyte-macrophage) 


-0.64636 


-1.28108 


P28072 


5694 


PSMB6 


25.3/5.2 


Proteasome (prosome, macropain) subunit, beta type, 6 


-0.69782 


-1.30544 


P0O352 


216 


ALD- 
H1A1 


54.7/4.3 


Aldehyde dehydrogenase 1 family, member Al 


-0.75872 


0.03356 


Q030I3 


2948 


GTM4 


25.3/5.0 


Glutathione ^transferase M4 


-0.78533 


0.134501 


P63241 


1984 


EIF5A 


10/4.4 


Eukaryotic translation initiation factor 5A 


-0.97893 


-1.44321 


Q01469 


2171 


EFABP 


18.0/4.2 


Fatty acid-binding protein 5 


0.25684 


-0.36432 



■Only the gene showing statistically significant increased protein expression with increases in both genomic copy number and transcript 
simultaneously will be considered as potential putative oncogene in lung adenocarcinoma cells. b r f Spearman correlation coefficients between 
proteins and genomic or mRNA values are based on all six canoer cell lines; bold indicates P<0.05, if r> 0.84000. Mw, molecular weight; pf, 
isoelectric point. 




immunohistochemical analysis), these findings are con- 
sistent with those from our cell lines, demonstrating 
again that genomic amplification and consequent 
increases in amounts of transcript may be, at least in 
part, driving the abundance of proteins in these lung 
tumors. The association between expression of these 
genes and that of Ki-67, a known indicator of poor 
prognosis in lung cancer (Martin et at. t 2004), suggests 
that activation of these genes may be an indicator of 
tumor aggressiveness. These results also suggest that 
expression of EEFIA2 and KCIP-I proteins in stage I 
non-smatl-ceil lung cancer may be useful as a marker for 
distinguishing patients with relatively poor prognosis 
from those who might benefit from adjuvant treatment. 



Discussion 

Our current study illustrates the power of integrated 
functional genomic analyses for identifying putative 
oncogenes and for evaluating their potential clinical 
significance. Among the four identified oncogenes, three 
genes {PRDXl, CALR, and KCIP-I) have been im- 
plicated in lung tumorigenesis. PRDXl is an antioxidant 
protein involved in regulating cell proliferation, differ- 
entiation, and apoptosis. Kim et al. (2003) found 
PRDXl expression to be elevated in both lung cancer 
and adjacent normal lung tissue, suggesting that 
activation of PRDXl may enhance proliferation in lung 
cancer. CALR has a major role in Ca 2 * binding and the 
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Figure t Confirmation by Southern, Northern, and Western blot 
analyses of increased DNA copies, transcript levels, and protein 
levels in the four genes identified in high-throughput analyses. For 
comparison, we arbitrarily chose one gene, NFKBI t in which an 
increased protein level did not correlate with genetic changes. The 
blotting results are consistent with the results from the CGH array, 
transcript array, and proteomic analyses. Nor, indicates normal 
bronchial epithelial cell line. All the experiments were repeated at 
least three times with each cell line. Means of normalized to 0-actin 
signal intensities on Southern* Northern, and Western blots, along 
with 95% confidence intervals, were calculated (0-actin signals are 
not shown in the figure; two different normal bronchial epithelial 
cell lines were used in the confirmation and only one normal cell 
line is shown in the figure). 



transcriptional regulation of other genes and was 
recently found to be overexpressed in 73% of 40 lung 
adenocarcinomas (Oates and Edwards, 2000). KCIP-1 
belongs to the 1 4-3-3 family, which participates via the 
MAPK and Wnt signaling pathways in the regulation of 
many cellular processes including cell proliferation and 
differentiation as well as tumorigenesis (Thomas et a/., . 
2005). KCIP~1 was recently found to be expressed in all 
12 lung tumors tested in a single-institution study (Qi 
et aL 9 2005). Interestingly, EEFIA2 was originally 
considered a putative oncogene in ovarian cancer on 
the basis of its being amplified in 25% and over- 
expressed in 30% of the same set of ovarian tumors 
(Anand et a/., 2002); functional analyses have estab- • 
lished its oncogenic role in cellular transformation (Lee, 
2003). Our discovery that EEFIA2 may be a putative 
oncogene in lung adenocarcinoma demonstrates the 
power of our functional genomic strategy for rapidly 
identifying potential oncogenes. 

Although the main focus of this study was to 
specifically identify putative oncogenes, it should be 
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noted that 90.7% of the genes showing high protein 
expression did not show corresponding increases 
in both DNA copy number and transcript, a finding 
consistent with that of others that transcriptional, 
% translationat, and post-translational regulatory mecha- 
nisms can greatly influence the abundance of protein 
in lung tumorigenesis (Chen et a/., 2002). For example, 
NFKBI is a critical arbiter of immune responses, 
cell survival, and transformation and is often activated 
in several types of tumors (Chen et a/., 2002). De- 
regulation of NFKBI is thought to be modulated 
through phosphorylation of Ser337 by protein kinase 
A (Chen et al, 2002). In our study, 68.8% of the 
genes showing over-representation in the genome 
did not show elevated transcript levels, implying 
that at least some of these genes are 'passenger* genes 
that are concurrently amplified because of their 
location with respect to amplicons but lack bio- 
logical relevance in terms of the development of lung 
adenocarcinoma. 

Although the potential oncogenes we identified here 
are likely to be important, certainly other oncogenes 
could be involved in the development of lung adeno- 
carcinoma. The oligo mjcroarray we used consists of . _ 
22 000 probes, which represent only about 60% of the 
human genome. Moreover, each probe was designed for 
the 3' region of expressed sequence tags of the selected 
genes. Also, our results were initially derived from 
cancer cell lines, although the findings were later 
confirmed in human tissue samples. Our ongoing study 
.using microarrays with information on more genes 
[and the development of high-resolution proteomic 
analyses for use with larger numbers of specimens will 
allow more comprehensive analyses of the molecular 
Iconseqnences of gene amplifications. Such expanded 
^analyses will very likely lead to the identification of 
1 additional oncogenes. 

Some of the results of our current study were 
comparable to those of other studies of lung cancer. 
For example, genomic copy number and protein levels 
of KCIP-1 were previously found to be amplified and 
overexpressed in primary lung cancers by cDNA clone- 
based CGH array analysis (Jiang et at., 2004) and 
proteomic analysis (Chen et aL t 2002), respectively. Our 
functional genomic approach, which integrates simulta- 
neous CGH, transcript microarrys, proteomic analyses, 
and siRNA, allows us not only to quickly identify 
potential oncogenes but also to explore their significance 
as diagnostic and therapeutic targets in tumor progres- 
sion - more than could be achieved by any technique 
alone. 

Genes identified in this way may serve as promising 
targets for diagnosis and therapy in lung adenocarci- 
noma. Further research on the clinical implications of 
such genes is needed; experiments now underway in our 
laboratory include overexpression of the genes in 
normal cells, disruption of the function of these genes 
in cancer cells, and investigation of how interactions 
among these genes (or interactions with other known 
oncogenes) may mediate the expression of the trans- 
formed phenotype. 
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Figure 2 EEFIA2 amplification is associated with high EEFIA2 protein expression in lung adenocarcinomas, (a) Cells from a lung 
adenocarcinoma sample in which EEFIA2 is amplified show more green signals (EEFJA2) than red signals (chromosome 20 
centromeric probe) (original magnification, x 400). (b) Immunohistochemical staining of cells from the same tissue sample as in panel 
a shows strong EEFI A2 staining in the cytoplasm, (c) A lung adenocarcinoma sample with two copies otEEFlA2 and chromosome 20 
centromeric probe, indicating no EEFIA2 amplification (original magnification, x 400). (d) Immunohistochemical staining of cells 
from the same tissue sample as in panel c shows negative staining for EEF1 A2. 



Materials and methods 
Cell lines 

Six human lung adenocarcinoma cell lines (H23, H229, HI 792, 
SK-LU-I, H522, and HI 563) were obtained from the 
American Type Culture Collection (Manassas, VA, USA). 
Two normal bronchial epithelial ceil lines were obtained from 
Clontech (Palo Alto, CA, USA). Genomic DNA, mRNA, and 
protein were derived from a single harvest of these cells. 

DNA and RNA profiles by microarray analysis 
Genomic DNA labeling and hybridization were performed as 
described previously (Barrett et aL t 2004) with Agilent's 
Human 1A Oligo Microarray (V2) (Agilent Technologies, 
Palo Alto, CA, USA), which contains 22000 unique 60-mer 
oltgos. Details of the protocol for analysing transcripts are 
available at http://www.chern.agilent.com. Map positions for 
arrayed genes were assigned by identifying the DNA sequence 
represented in the UniGene duster and matching it with the 
Golden Path genome assembly (http://genome.ucsc.edu/; Mat 
7, 2004 Freeze). Microarray images of DNA copy number and 
expression were analysed by using AgilentCGH Analytics and 
Feature Extraction software. DNA copy number profiles that 
deviated significantly from background signal ratios (measured 
from normal control cell hybridization, as described elsewhere; 
Barrett et a!., 2004) were interpreted as evidence of true 
differences in DNA copy number. The criteria for defining 
genomic over-representation and amplicons are described 
elsewhere (Hyman et ai. t 2002); details are given in the 



Supplementary Information. An increase in mRNA level was 
defined as a twofold increase in signal ratio relative to that of 
the control (log 2 > 1). 

Quantitative two-dimensional PAGE and mass spectrometry 
Analysis of proteins by two-dimensional PAGE and their 
identification by mass spectrometry were performed as 
previously described (Shen et a!. t 2004). Briefly, protein pellets 
were solubiiized in rehydration butler, after which the first- 
dimension isoelectric focusing was carried out with a Protean 
IEF Cell (Bio-Rad Laboratories) and the second-dimension 
separation was carried out with Bio-Rad's Ready Gel Precast 
Gels and the Bio-Rad Criterion Cell apparatus. Protein spots 
were visualized by silver-based staining, and all gels were 
assessed with Bio-Rad's PDQuest 2D gel image analysis 
software. Selected spots were subjected to in-gel tryptic 
digestion and analysed on a Voyager-DE PRO matrix-assisted 
laser desorption tonization/time-of-rlight mass spectrometer 
(Applied Biosystems, Foster City, CA, USA). The mass list of 
the 20 most intense monoisotoptc peaks for each sample was 
entered in the MS-Fit search program (v3.2.1) (http:// 
prospector.uwf.edu/ucsfhtml4.0/msfit.htra) and searched in 
the National Center for Biotechnology Information protein 
database. 

Southern, Northern, and Western blot analyses 
Southern, Northern, and Western blot hybridizations were 
performed according to standard protocols. cDNA clones for 
the tested genes were purchased from Invitrogen (Carlsbad, 
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CA, USA) and prepared as probes for the blot hybridizations. 
Antibodies used were obtained as follows: PRDX1, CALR, 
NFKB1, KCIP-l, and £-actin from Santa Cruz Biotechnology 
(Santa Cruz, CA, USA); and EEF1A2 from Upstate Biotech- 
nology (Walthara, MA, USA). 

Fluorescence in situ hybridization and immunohistochemical 
analyses of lung tissue microarrays 

Fluorescence in situ hybridizations and immuaohistochemical 
analyses of KCIP-1 and EEF1A2 were carried out as described 
elsewhere (Jiang et aL % 2002; Wang et aL, 2005) with Lung 
Tissue Microarrays (Ambion, Austin, TX, USA) and 11 
homemade microarray blocks containing tissue samples from 
113 patients with pathologic stage I non-small-cell lung cancer 
(Wang et aL t 2005). DNA probes specific for KCIP-I and 
EEFJA2 were obtained by screening a Human BAC Clone 
library (Invitrogen) by polymerase chain reaction as described 
previously (Jiang et aL, 2002). The antibodies used for the 
immunohistochemical analyses were the same as those used 
for the Western blotting. Cell proliferation of the lung tissues 
was assessed with a Ki-67 monoclonal antibody from Santa 
Cruz Biotechnology. Definitions of the cutoff value for a 
positive result of each antibody are shown in Supplementary 
Information. 

siRNA transfection, cellular proliferation assay, and apoptosis 
analysis 

Trausfections were carried out by using siPORT Lipid 
Transfection Agent (Ambion) with siRNAs targeting KCIP-l 
or EEFIA2 or with a scrambled siRNA duplex (siControI) 
(Dharmacon Inc., Lafayette, CO, USA), with PBS used as a 
negative control (Jiang et aL, 2002). Cells were fixed 24, 48, or 
96 h later and subjected to further tests. Ail siRNAs were 
prepared by using a transcription-based method with Silencer 
siRNA according to the manufacturer's instructions (Am- 
bion). Sequences of the individual siRNAs are listed in 
Supplementary Table 4S. Inhibition of .cell growth by the 
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siRNAs was determined by MTT staining, and cell growth 
rate was plotted against the percentage of viable cells in the 
saline-treated controls (a value arbitrarily set at 100%) (Jiang 
et al. t 2002). Apoptosis was analysed by fluorescence cell 
cycle analysis of terminal deoxynucleotidyl transferase- 
mediated dUTP nick-end labeling with FITC-labeled dUTP 
(Boehringer Mannheim Biochemicals, Mannheim, Germany) 
(Jiang et aL, 2005). 

Statistical analyses 

Relationships between gene copy number and mRNA level 
were examined as described elsewhere (Hyman et ai t 2002, 
Supplementary Information). Correlations between protein 
abundance and DNA copy number and mRNA expression of 
the corresponding genes were evaluated with the Spearman 
correlation coefficient. Fisher's exact test and x 2 -tests were 
used to analyse associations between amplification and 
expression of the candidate genes with various histopathologic 
variables of the samples in the tissue microarrays. Univariate 
and multivariate analyses were carried out with Cox's 
proportional hazards model to determine which independent 
factors might have a joint significant influence on survival. A 
lvalue ^0.05 was considered statistically significant; all 
statistical tests were based on a two-sided significance level. 
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ABSTRACT The consistent cytogenetic tram cation of 
chronic myelogenous leukemia (the Philadelphia cl; mosome, 
Ph 1 ) has been observed in cells of multiple hematopoietic 
lineages. This translocation creates a chimeric ge- composed 
of breakpoint-cluster-region (bcr) sequences from -omosome 
22 fused to a portion of the abl oncogene on chromo me 9. The 
resulting gene product (P210 c ,bl ) resembles the i usformlng 
protein of the Abelson murine leukemia virus in its structure 
and tyrosine kinase activity. P210 c "* bl is expressed In Ph 1 - 
positive cell lines of myeloid lineage and In clinical specimens 
with myeloid predominance. We show here that Epstein-Barr 
virus-transformed B-lymphocyte lines that retain Ph 1 can 
express P210 C Bbl . The level of expression in these B-ceil lines is 
generally lower and more variable than that observed for 
myeloid lines. Protein expression is not related to amplification 
of the abl gene but to variation In the level of bcr-abl mRNA 
produced from a single Ph 1 template. 



Chronic myelogenous leukemia (CML) is a disease of the 
pluripotent stem cell (1). In greater than 95% of patients, the 
leukemic cells contain the cytogenetic marker known as the 
Philadelphia chromosome, or Ph 1 (2). This reciprocal 
translocation event between the long arms of chromosomes 
9 and 22 has been used as a disease-specific marker for 
diagnosis and evaluation of therapy. Multiple hematopoietic 
lineages, including myeloid and B-lymphoid, contain Ph 1 in 
early or chronic phase, as well as in the more acute accel- 
erated and blast crisis phases of the disease. 

One molecular consequence of Ph 1 is the translocation of 
the chromosomal arm containing the c-abl gene on chromo- 
some 9 into the middle of the breakpoint-cluster region (bcr) 
gene on chromosome 22 (3-6). Although the precise 
translocation breakpoints are variable, an RNA-splicing 
mechanism generates a very similar 8-kilobase (kb) mRNA in 
each case (5-9). The hybrid bcr-abl message encodes a 
structurally altered form of the abl oncogene product, called 
P210 c * abI (10-13), with an amino-terminal segment derived 
from a portion of the exons of bcr on chromosome 22 and a 
carboxyl-terminal segment derived from a major portion of 
the exons of the c-abl gene on chromosome 9. The chimeric 
structure of bcr-abl and the resulting P210 c " flbI is similar to the 
structure of the Abelson murine leukemia virus gag-abl 
genome and resulting P160 V Bbl transforming gene product. 
Both proteins have very similar tyrosine kinase activities (10, 
11, 14) which can be distinguished by their relative stability 
to denaturing detergents and by their ATP requirements from 
the recently described tyrosine kinase activity of the c-abl 
gene product (15). 
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In concert with structural modification of the amino- 
terminal portion of the abl gene, increased level of expression 
has been implicated in activation of c-abl oncogenic poten- 
tial. Myeloid and erythroid cell lines and clinical samples 
derived from acute-phase CML patients contain about 10- 
fold higher levels of the 8-kb bcr-abl mRNA and P210 CBbl than 
the c-abl mRNA forms (6 and 7 kb) and P145 c abl gene product 
(5, 8, 9, 11). The higher level of expression of the chimeric 
bcr-abl message in acute-phase cells is not likely to be solely 
due to the presence of the bcr promoter sequences at the 5' 
end of the gene, since the normal 4.5-kb and 6.7-kb bcr- 
encoded mRNA species are expressed at an even lower level 
than the normal c-abl messages (5, 6). 

We have analyzed a series of Epstein-Barr virus-immor- 
talized B-lymphoid cell lines derived from CML patients (16). 
With such in vitro clonal cell lines, we can evaluate whether 
the presence of Ph 1 always results in synthesis of the chimeric 
bcr-abl message and protein, and whether the quantitative 
expression varies for cells of B-lymphoid lineage as com- 
pared to previously examined myeloid cell lines. Our results 
show that cell lines that retain Ph* do express bcr-abl message 
and protein, but that the level is generally lower and more 
variable than previously seen for myeloid cell lines. The 
demonstration that the Ph 1 chromosomal template can vary 
in its level of expression of P210 cabl suggests that secondary 
mechanisms, beyond the translocation itself, contribute to 
the regulation of the bcr-abl gene in different cell types or 
subclones that derive from the affected stem cell. 

MATERIALS AND METHODS 

Cells and Cell Labelings. Epstein-Barr virus-transformed 
B-lymphoid cell lines were established from peripheral blood 
samples of chronic- and acute-phase CML patients as report- 
ed (16). The cell lines are designated according to patient 
number, karyotype, and lineage. For example, SK- 
CML7Bt(9,22)-33 refers to CML patient 7, B-lymphoid ceil 
line, 9;22 translocation (Ph 1 ), cell line 33; and SK-CML7BN- 
2 refers to B-cell line 2 with a normal karyotype derived from 
the same patient. Repeat karyotype analysis was performed 
to verify the retention of Ph 1 just prior to analysis for abl 
protein and RNA. Cells were maintained in RPMI 1640 
medium with 20% fetal bovine serum. We have not observed 
any consistent pattern of in vitro growth rate that correlates 
to the stage of disease at the time of transformation with 
Epstein-Barr vims. Cells (1.5 x 10 7 ) were washed twice with 
Dulbecco's modified Eagle's medium lacking phosphate and 



Abbreviations: bcr, breakpoint-cluster region; CML, chronic 
myelogenous leukemia; kb, kilobasc(s). 
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populations thai lack the Ph 1 marker, supports a secondary 
or complementary role for Ph 1 in the progression of the 
disease (24, 25). This chromosome marker is found in 
chronic, accelerated, and blast-crisis phases of the disease. It 
is likely that Ph 1 confers some growth advantage, since cells 
with the marker chromosome eventually predominate the 
marrow and peripheral blood even in chronic phase. During 
the phase of blast crisis, many patients develop additional 
chromosome abnormalities, including duplication of Ph 1 , a 
variety of trisomies, and complex translocations (26). This 
is suggestive evidence for Ph 1 being a necessary but not 
sufficient genetic change for the full evolution of the 
disease. 

The realization that one molecular result of Ph 1 is the 
generation of a chimeric bcr-abl protein with functional 
characteristics and structure analogous to the gag-abl trans- 
forming protein of the Abelson murine leukemia virus 
strengthens the argument for an important role of Ph 1 in the 
pathogenesis of CML. Although the Abelson virus is gener- 
ally considered a rapidly transforming retrovirus, its effects 
can range from overcoming growth factor requirements, to 
cellular lethality, to induction of highly oncogenic tumors in 
a number of hematopoietic cell lineages (27, 28). Even in the 
transformation of murine cell targets, there are several lines 
of evidence that suggest that the growth-promoting activity of 
the v-abl gene product is complemented by further cellular 
changes in the production of the malignant-cell phenotype 
(29-31). 

The regulation of bcr-abl gene expression is complex 
because the 5' end of the gene is derived from the non-abl 
sequences, bcr, normally found on chromosome 22 (6). The 
level of stable message for the normal bcr gene and the 
normal abl gene are both much lower than the level of the 
bcr-abl message and protein from cell lines and clinical 
specimens derived from myeloid blast-crisis patients (5, 6, 
11). Therefore, the high level of bcr-abl expression cannot 
simply be attributed to the regulatory sequences associated 
with bcr. Possibly, creation of the chimeric gene disrupts the 
normal regulatory sequences and results in a higher level of 
expression. Variation in bcr-abl expression may result from 
secondary changes in the structure of the chimeric gene or 
function of fra/u-acting factors that occur during evolution of 
the disease. Our analysis of P210 c - lbl and the 8-kb mRNA in 
Epstein-Barr virus-transformed Prepositive B-cell lines 
demonstrates that stable message and protein levels from the 
bcr-abl gene can vary over a wide range. This variation does 
not result from a change in the number of bcr-abl templates 
secondary to gene amplification but more likely from changes 
in either transcription rate or mRNA stability. We suspect 
this range of bcr-abl expression is not limited to lymphoid 
cells. Analysis of peripheral blood leukocytes derived from 
an unusual CML patient who has been in chronic phase with 
myeloid predominance for 16 years showed a level of 
P210 c abl one-fifth that of P145 cabf , as detected by metabolic 
labeling with [ 32 P]orthophosphate and immunoprecipitation 
(S.C., O.N.W., and P. Greenberg, unpublished observa- 
tions). Lower levels of expression of the chimeric mRNA 
have been demonstrated in clinical samples from chronic- 
phase CML patients compared to acute-phase CML patients 
(9). Others have reported chronic-phase patients with vari- 
able but, in some cases, relatively high levels of the bcr-abl 
mRNA (32). The sampling variation and the heterogenous 
mixture of cell types in clinical samples complicate such 
analyses. Further work is needed to evaluate whether there 
is a defined change in P210 cabl expression during the pro- 
gression of CML. It is interesting to note that among the 
limited sample of Ph l -positive B-cell lines we have examined 
(Table 1), we have seen higher levels of P210 c abt in those 
derived from patic " at more advanced stages of the disease. 



It will be important to search for cell-type-specific mecha- 
nisms that might regulate expression of bcr-abl from Ph 1 . 
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Abstract; Epithelial tumors develop through a multistep process driven by 
genomic instability frequently associated with etiologic agents such as pro- 
longed tobacco smoke exposure or human papilloma virus (HPV) infection. 
The purpose of the studies reported here was to examine the nature of genomic 
instability in epithelial tissues at cancer risk in order to identify tissue genetic 
biomarkers that might be used to assess an individual's cancer risk and 
response to chemoprcventivc intervention. As part of several chemoprevention 
trials, biopsies were obtained from risk tissues (i.e., bronchial biopsies from 
chronic smokers, oral or laryngeal biopsies from individuals with premalig- 
nancy) and examined for chromosome instability using in situ hybridization. 
Nearly all biopsy specimens show evidence for chromosome instability 
throughout the exposed tissue. Increased chromosome instability was observed 
with histologic progression in the normal to tumor transition of head and neck 
squamous cell carcinomas. Chromosome instability was also seen in prernatig- 
nant head and neck lesions, and high levels were associated with subsequent 
tumor development. In bronchial biopsies of current smokers, the level of 
ongoing chromosome instability correlated with smoking intensity (e.g., 
packs/day ), whereas the chromosome index (average number of chromosome 
copies per cell) correlated with cumulative tobacco exposure (i.e., pack-years). 
Spatial chromosome analyses of the epithelium demonstrated multifocal clonal 
outgrowths. In former smokers, random chromosome instability was reduced; 
however, clonal populations appeared to persist for many years, perhaps 
accounting for continued lung cancer risk following smoking cessation. 

Keywords: chromosome instability; epithelial cells; aerodigestive tract; 
chemoprevention; cancer risk 



THE NEED FOR BIOMARKERS OF CANCER RISK AND 
RESPONSE TO INTERVENTION 

Epithelial cancers remain a major health challenge in the world. Despite improve- 
ments in staging and the application and integration of surgery, radiotherapy, and 
chemotherapy, the 5-year survival rate for individuals with lung cancer is only about 
15%.' Even if strategies for early detection are successful and lung cancers 
are detected at a stage where local tumor resection and treatment is curative, 
these patients will still be at significant risk for developing second primary tumors 
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associated with the problem of field cancelation. 2 Similarly, for individuals with a 
first head and neck primary tumor, even if the first malignancy is successfully treat- 
ed, the risk of developing a second primary in the tobacco smoke-exposed field is 
approximately 40%. 3 Similar cancer risk estimates exist for individuals who exhibit 
severe dysplasia in premalignant epithelial lesions. 4 For these reasons, it is important 
to focus on chemopreventive strategies to prevent the development of epithelial 
malignancies. 

Several problems confront chemoprevention trials designed to identify effica- 
cious agents. 5 First, chemoprevention trials with cancer incidence as a primary end- 
point require tens of thousands of subjects and tens of years of intervention and 
follow-up for statistical evaluation. For example, a recendy reported trial involved 
30,000 subjects and required 10 years in order to examine the impact of prevention 
strategies on lung cancer development, only to find a possible increased lung cancer 
incidence in current smokers who received [3-carotene. 6 

The problem of large, long-term trials results from the difficulty in identifying 
individuals at highest cancer risk who might best benefit from chemopreventive 
intervention. For example, 20 pack-year smokers, while known to be at relatively 
increased risk for developing lung cancer, have approximately a 10% lifetime risk 
for developing lung cancer/ This seriously limits the number of potentially useful 
strategies that can be clinically explored. A second problem facing chemoprevention 
trials is that little is known about what agents are likely to have efficacy, and even 
less is known regarding proper doses, schedules, and durations of treatment. Part of 
the reason for this problem is that too little is known about the physiologic processes 
that drive epithelial cancer development. 

In order to reduce the number of subjects and the time required to carry out 
chemoprevention trials and thus allow the exploration of multiple prevention strate- 
gies, two types of advances are necessary. First, it is important to identify individuals 
at significantly increased cancer risk who might best benefit from different types of 
intervention. Second, in order to allow the rapid identification of agents, doses, and 
schedules of potentially efficacious agents, it is necessary to identify and validate 
surrogate endpoints of response that indicate whether the agents are having a posi- 
tive impact on the target tissue during the chemopreventive intervention. 

One approach to identifying individuals at increased aerodigestive tract cancer 
risk is to explore epidemiologic features of potential subjects. Molecular epidemio- 
logic studies are beginning to identify intrinsic host factors that place some individ- 
uals at increased cancer risk, especially those with a chronic smoking history. 8 Most 
intrinsic factors identified thus far reflect levels of carcinogen metabolism, repair 
capabilities of the host following DNA damage, and other measures of intrinsic 
cellular sensitivity to mutagens. While these factors can provide statistically signif- 
icant risk ratios in case-control studies that are controlled for tobacco exposure, the 
detected risk ratios usually fall in the range of 1.5 to 10. Unfortunately, this is not 
sufficient for the individualization of treatment and is not sufficiently high to signif- 
icantly reduce the numbers of subjects required for chemoprevention trials with 
cancer incidence as the primary endpoint. 

Another approach to identifying individuals at increased cancer risk is to directly 
examine the target tissue of individuals with known carcinogen exposure (e.g., 
chronic tobacco smoke exposure), who have evidence of target organ dysfunction 
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(e.g., chronic obstructive pulmonary disease, changes in voice quality), or who 
have clinical evidence of premalignancy (e.g., bronchial metaplasia/dysplasia, oral 
leukoplakia/erydiroplakia, cervical intraepithelial neoplasia). The conventional 
standard for assessing cancer risk in these situations is the degree of histological 
change. However, while individuals who show moderate to severe dysplasia are 
known to be at increased cancer risk when compared to individuals with lesser his- 
tologic changes, it is often difficult to distinguish reactive changes to carcinogenic 
insult from initiated and progressing lesions. Similarly, upon cessation of carcino- 
genic insult, histologic changes may reverse yet cancer risk may continue for many 
years. For example, while smoking cessation is associated with decreased bronchial 
metaplasia, 9 increased lung cancer risk continues for many years beyond smoking 
cessation. 10 In fact, nearly half the newly diagnosed lung cancer cases in the USA 
occur in former smokers.* 1 

The development of assays to identify individuals at high epithelial cancer risk 
and to directly assess response to intervention in the target tissue is therefore an 
important research goal. Such assays should be objective and easily quantifiable and, 
if possible, minimally invasive. Moreover, they should reflect both the disease pro- 
cess and the targeted pathway and thereby be useful in assessing risk and monitoring 
response to intervention as well as directly testing the hypothesized mechanism of 
action of the chemopreventive strategy. 

In the chemoprevention setting it is important to recognize that one does not 
know the location of the future cancer. Thus, assays must necessarily be carried out 
on random biopsies of the field at risk. Even if (here are clinically evident premalig- 
nant lesions, this does not mean that this is the likely site for a future malignancy. 
For example, nearly half of the cancers thai develop in individuals with oral leuko- 
plakia arise away from the original index lesion. Similarly, since many newly diag- 
nosed lung cancers arise in the peripheral parts of the lung (e.g., adenocarcinomas), 
especially in former smokers, and since endobronchoscopy predominantly accesses 
central components of the lung, it is important to identify biomarkers that can reflect 
global processes ongoing in the target epithelial field associated with increased can- 
cer risk. Their discovery requires a better understanding of the tumori genesis pro- 
cess in epithelial fields at cancer risk. 



THE RATIONALE FOR STUDYING 
GENOMIC INSTABILITY AS A MARKER OF RISK 

Tumors of the aerodigestive tract have been proposed to reflect a "field canceriza- 
tion" process whereby the whole tissue is exposed to carcinogenic insult (e.g., tob- 
acco smoke) and is at increased risk for mukistep tumor development. 12,13 Several 
types of clinical and laboratory data support this notion, including the frequent 
occurrence of synchronous primary and subsequent second primary tumors in the 
aerodigestive tract (frequently exhibiting dissimilar histologies as well as distinct 
genetic signatures 14-16 ) and the presence of premalignant lesions that precede and/or 
accompany the tumor in the exposed tissue field. 1 ' The notion of a multistep tumor- 
igenesis process-is further supported by serial clinical and histologic evaluations of 
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target tissue or exfoliated cells where increasing degrees of histological abnormali- 
ties are observed over time. 18 

A working model for aerodigestive tract tumorigenesis is illustrated in Figure 1 . 
Tumorigenesis in the face of carcinogenic exposure likely involves a chronic process 
of tissue injury and wound healing. DNA damage induced by the carcinogen is likely 
fixed into permanent genetic changes (e.g., chromosome damage, chromosome non- 
disjunction, gene mutation, gene deletion, etc.) during the process of proliferation. 
This damage would be expected to be distributed throughout the exposed tissue field 
leading to a background of generalized genomic damage (depicted in Figure 1 as a 
background mat of increasing density). Chronic injury and repair likely leads to the 
accumulation of cells with increasing amounts of genetic changes as well as the out- 
growth of abnormal clones (triangles in Figure I) carrying an accumulation of 
genetic changes important for selective survival, dysregulated growth, and preferen- 
tial epithelial take-over by initiated clones (see Figure 2). 

Cellular and molecular evidence for the field carcinogenesis and multistep tum- 
origenesis model comes from many laboratories. 19,20 With the advent of a wide array 
of molecular technologies, a large number of specific molecular genetic and epi ge- 
netic changes involving specific oncogenes, tumor suppressor genes, cell regulatory 
genes, and repair genes have now been described for aerodigestive tract cancers. The 
identification of these specific molecular changes have now provided probes to 
explore specific events occurring in premalignant lesions adjacent to aerodigestive 
tract tumors. 21 " 24 Frequently, these premalignant lesions showed a subset of the 
same molecular changes found in the associated tumor, suggesting that these lesions 
might represent precursor lesions for the associated tumors (i.e., a manifestation of 
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FIGURE I. Field cancerization and multistep tumorigenesis. 
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FIGURE 2. Multiple focal clonal evolution during multistep turn ori genesis. 



a multistep tumorigenesis process). For example, studies of the premalignant lesions 
adjacent to head and neck tumors have provided evidence for a gradual accumulation 
of genetic alterations accompanied by evidence for dysregulation of cellular control 
mechanisms (e.g., alterations in expression of PCNA, EGFR, TGF-p, p53, and 
cyclinDl). 25 " 28 

These types of studies have now also been applied to the target epithelium of indi- 
viduals at increased risk for aerodigestive tract cancer (i.e., individuals with a chron- 
ic smoking/alcohol history and/or prior aerodigestive tract cancer). Several groups 
(using polymerase chain reaction, PCR, analysis of microdissected epithelium) have 
now demonstrated the presence of clonal outgrowths in the target premalignant epi- 
thelium of individuals at increased risk for cancer. 29 " 31 For example, examination of 
bronchial biopsies derived from individuals with a 20 pack-year smoking history 
demonstrated that 76% of the cases showed evidence for LOH (3pl4, 9p21, or 
1 7p 1 3) in at least one of six lung biopsy sites. On a per site basis, some form of LOH 
was observed in 25% of the sites examined. 29 

If aerodigestive tract cancer development reflects a field cancerization process 
involving multistep events, then risk and response information should be able to be 
derived from random biopsies or exfoliated cells from the field at risk or from assess- 
ments of tissue undergoing simitar processes. Hypothetically, lesions exhibiting the 
greatest degree of genomic instability, clonal outgrowth, and abnormal epithelial 
regulation would be at the highest relative aerodigestive tract cancer risk. Similarly, 
an active chemopreventive intervention might be expected to decrease these mani- 
festations of risk. Reduced risk manifestations include decreased levels of ongoing 
genetic instability, decreased frequency of clonal outgrowths, and increased epithe- 
lial growth regulation. 
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THE MEASUREMENT OF CHROMOSOME INSTABILITY USING 
CHROMOSOME IN SITU HYBRIDIZATION 

Molecular genetic techniques, while extremely useful for detecting clonal chang- 
es in targets tissues, are somewhat limited in their ability to detect random genetic 
instability. Conventional cytogenetic assays are useful for detecting chromosome 
instability and clonal chromosome changes. However, they require numbers of 
dividing cells for karyoiypic analysis that are difficult to attain in the setting of biop- 
sies acquired during the course of a chemoprevention trial. A technique was there- 
fore needed that would allow chromosome instability measurements in situations 
where few cells are available (e.g. small biopsies, brushings, or sputum samples) and 
where the target material might be fixed. It was also desirable to have a technique 
that would be adaptable to tissue sections, whereby spatial information could be 
retained and genotype/phenotype associations could be determined on the same or 
adjacent tissue sections. The technique of in situ hybridization (ISH) involves the 
use of DNA probes that recognize either chromosome-specific repetitive target 
sequences, chromosome single gene copy sequences, or sequences along the whole 
chromosome length or chromosome segments. 32 We have adapted the ISH technique 
for fonnal in-fixed, paraffin -embedded tissue sections and have applied it to a variety 
of tissues, including the aerodigestive tract.** 3,34 

Using probes that label the centromere regions uf specific chromosomes, this, 
assay permits determination of the average chromosome number per cell for each 
specimen. This assay is also useful for detecting generalized chromosome instability 
during the tumorigenesis process. Normal diploid populations should have two cop- 
ies of each autosomal chromosome and should rarely show three or more chromo- 
some copies per cell (chromosome polysomy), especially in tissue sections where 
nuclear truncation results in an under-representation of chromosome copy number. 
Thus, the detection of cells with three or more chromosome copies would indicate 
the presence of chromosome instability. 

To examine this technique's potential for characterizing the multislep tumorigen- 
esis process in the aerodigestive tract, we measured the fraction of cells exhibiting 
three or more chromosome copies in apparently contiguous epithelial transitions 
from normal to hyperplastic to dysplastic to carcinomas, all on a single tissue slice 
of head and neck squamous cell carcinomas. 34 In these specimens, greater than 35% 
of the cases of adjacent "normal" epithelium, greater than 65% of the cases of hyper- 
plastic epithelium, and greater than 95% of the dysplastic and tumor regions showed 
evidence of chromosome polysomy. Of interest, similar transitions of chromosome 
instability were observed with at least four different chromosome probes. Similar 
trends have also been observed in amenable tissue from other epithelial malignan- 
cies, including cervix, bladder, and breast. 35 These results thus suggested that the 
notions of field cancerization and multistep tumorigenesis might apply to several 
epithelial tissues and that measures of chromosome instability might be useful for 
monitoring this process. 

In the situations described above, the premalignant lesions examined might be 
considered to represent epithelium at 100% risk of being in a cancer field, since they 
were located in the adjacent epithelium to the cancer. This then raises the question 
of the nature of genetic instability in the epithelium of individuals at increased risk 
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for developing cancer. To explore this issue, we obtained biopsies during the course 
of leukoplakia chemoprevention trials exploring the use of 1 3-cw-retinoic acid in 
reversing leukoplakia and probed them for genetic instability using in situ hybridiza- 
tion. In one retrospective study and in one prospective study of subjects with oral 
leukoplakia, the results indicate that those subjects whose pretreatment biopsies har- 
bor relatively high levels of genomic instability (i.e., more than 3% of the cells 
examined showing at least 3 chromosome 9 copies per cell) have a significantly 
higher likelihood of suffering early onset of head and neck cancer. 36,37 Interestingly, 
half of the tumors that did develop occurred away from the biopsy site used to mea- 
sure genetic instability. This result suggests that genomic instability measurements 
in c arc inogen-ex posed tissue can provide useful cancer risk estimates. 



THE RELATIONSHIP BETWEEN TOBACCO EXPOSURE AND 
CHROMOSOME INSTABILITY 

In recent years, the aerodigestive tract chemoprevention group at M.D. Anderson 
Cancer Center has initiated three sequential biomarker-associated chemoprevention 
trials involving chronic smokers with a greater than 20 pack-year smoking history. 
In each of these studies, endobronchial biopsies were obtained from six defined sites 
within the lung, including the carina and at bifurcation points at the upper, middle, 
and lower right lung and at the upper and lower left lung. Biopsies were obtained pri- 
or to and following chemopreventive intervention and were subjected to in situ 
hybridization analysis in addition to analyses for other biomarkers. The first impor- 
tant finding was that some degree of chromosome polysomy was evident in all lung- 
sites examined, and this was observed independently of the particular chromosome 
probe utilized. 38 This finding supports the notion that random chromosome changes 
may be occurring throughout the exposed lung field. 

In a second study, bronchial biopsies were obtained from individuals with a 20 
pack-year smoking history. In this study, most of the subjects involved were current 
smokers. 39 Interestingly, all cases who showed metaplasia at one of six biopsy sites 
also showed chromosome polysomy in at least one biopsy site; overall, 88% of the 
sites showed some evidence of chromosome 9 polysomy, 40 Evidence for genetic 
instability was also detected in patients who did not show evidence of bronchial 
metaplasia in any of six biopsy sites despite a strong smoking history. In fact, more 
than 90% of the cases and more than 60% of the sites showed significant chromo- 
some polysomy (i.e., ai least three copies in at least 2 % of the cells examined). 
These results suggest that the lungs of long-term smokers show significant evidence 
of genetic instability, and this instability can be detected throughout the accessible 
bronchial tree, even when bronchial metaplasia is not evident. 

These studies in current smokers has allowed us to examine the relationship 
between the levels of genetic instability detected and subject characteristics such as 
smoking status (current or former), smoking history, and lung tissue pathologic 
changes. Evaluable biopsy material has now been obtained from more than 108 cur- 
rent smokers, including more than 480 evaluable biopsy sites. The mean metaplasia 
index in these current smokers was 30.4%. For the total population studied, the 
median chromosome index for the bronchial biopsies was 1.41 (range, 1.04-1.61) 
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and the median chromosome polysomy index was 2.0% (range 0-8.7%). This can be 
compared to a mean chromosome index between 1.2-1.4 for lymphocytes and very 
rare chromosome polysomy. Interestingly, the intrasubject variability in chromo- 
some instability was relatively low in most subjects and was less than the intersub- 
ject variability. These results suggested that chronic smokers harbor detectable 
chromosome instability throughout the accessible bronchial tree (supporting the 
field carcinogenesis notion) and that information from one biopsy site might yield 
representative information for the rest of the lung field. 

Since most of the current smokers exhibited bronchial metaplasia in at least one 
of the biopsied sites, this allowed us to examine the relationship between chromo- 
some instability and histologic changes, both on a site-by-sitc basis and on a per case 
basis. On a site-by-site basis, the chromosome indices of lesions showing squamous 
metaplasia were similar to those not showing metaplasia (i.e., median 1.43 vs. 1.43), 
and the degree of chromosome polysomy in metaplastic lesions were only slightly 
higher than in non-metaplastic sites (medians: 2.2% vs. 1 .8%, respectively). Thus, 
the presence or absence of squamous metaplasia at a biopsy site does not necessarily 
correlate with the degree of underlying genomic instability. On the other hand, those 
subjects with metaplasia indices of at least 15% also showed higher levels of chro- 
mosome polysomy than did subjects with metaplasia index below 15% (medians: 
2.4% vs. 1 .8%, p - 0.005). Thus, these chromosome instability assessments in cur- 
rent smokers appeared to reflect a more global process in the lung field. 

Tobacco exposure has been shown to significantly increase the risk of developing 
lung cancer, and the degree of risk is related to the extent of tobacco exposure. We 
were interested in determining the relationship between individuals' smoking histo- 
ry parameters and the levels of chromosome change found in their lungs following 
years of tobacco exposure. While there was significant intersubject variation for sim- 
ilar tobacco exposure histories, overall there was a significant correlation between 
the degree of chromosome polysomy and the intensity of ongoing tobacco exposure 
(packs/day, p = 0.02 on a per site basis) and with the extent of tobacco exposure 
(pack-years, p = 0.003). Thus the amount of chromosome polysomy reflects the 
intensity and extent of tobacco exposure. At the same time, individuals with similar 
smoking histories showed widely divergent amounts of chromosome polysomy, pos- 
sibly reflecting differences in intrinsic sensitivity between subjects. There was also 
strong correlation between the chromosome index and the duration of the smoking 
history (smoking years) and total accumulated exposure (pack-years, p = 0.0001). 
These results suggest that tobacco exposure is associated with the initiation and 
accumulation of chromosome instability in the exposed lung; however individuals 
are differentially sensitive to carcinogenic insult. The working hypothesis is that 
those individuals who accumulate the highest degree of chromosome changes will 
be at the highest lung cancer risk. 

Many of the bronchial biopsies from chronic smokers examined by in situ hybrid- 
ization showed a rise in the chromosome index above that expected for a diploid cell 
population, especially in subjects with an extensive smoking history. The rise in 
chromosome index was also accompanied by an increase in the fraction of cells 
exhibiting at least 3 chromosome copies per cell. To determine if a rise in the tissue 
chromosome index was due to clonal expansion of populations with chromosome tri- 
somy, the chromosome copy number and relative coordinates of each cell scored in 
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the bronchial epithelium was recorded and a spatial genetic map was created. 4 * We 
then developed algorithms for calculating localized chromosome indices within the 
tissue. Since trisoinic clones would have, on average, three chromosomes instead of 
two, those ceils involved in neighborhoods with chromosome indices three-halves 
that of diploid populations could be marked as being part of a trisomic clone. Simi- 
larly, groups of cells with chromosome indices half that of diploid populations could 
be marked as being part of a monosomic clone. This allowed the generation of a sec- 
ond-order, two-dimensional genetic map representation of the bronchial epithelium 
showing the relative locations of cells involved in monosomic and trisomic clonal 
outgrowths. When adjacent tissue sections from the same bronchial biopsy were 
probed separately for different chromosomes, the detected clones appeared to occu- 
py separate subregions of the epithelium. This result suggests that not only are the 
lungs of chronic smokers undergoing a process of genetic instability, they are expe- 
riencing the outgrowth of multiple clones throughout the exposed lung field, as pos- 
tulated by the models shown in FIGURES I and 2. One advantage of this clonal 
approach is that the contribution of both monosomic and multisomic clones can be 
detected. 

Since smoking cessation has been suggested to reduce the lung cancer risk, it was 
of interest to determine whether the levels of chromosome instability would decrease 
following smoking cessation. This question was possible to examine because our 
third sequential chemoprevention trial involved subjects who had discontinued 
smoking. So far, more than 220 subjects (more than 650 biopsies) who have quit 
smoking (mean 9.9 quit-years) have been evaluated for chromosome instability in 
their lungs. Despite the fact that the mean metaplasia index in this group is 5.8% 
(considerably less than that in current smokers), chromosome instability is still 
observed in the majority of subjects. 42 While the mean chromosome polysomy level 
is reduced to 1.0%, some individuals continue to show polysomy levels above 5%. 
Interestingly, while the overall chromosome polysomy levels were reduced in these 
individuals who stopped smoking, the mean chromosome index remained at about 
1 .4 with some individuals exhibiting chromosome indices as high as 1.8. Initial chro- 
mosome mapping studies suggest that while random chromosome instability seems 
to decrease following smoking cessation, the clonal outgrowths may remain for 
many years in the lung. The working hypothesis is that those individuals who show 
the greatest degree of remaining chromosome instability are at the highest lung can- 
cer risk despite smoking cessation. Long-term follow-up on these subjects will be 
necessary to test this hypothesis. 



SUMMARY AND CONCLUSIONS 

Acrodigestivc tract tumorigenesis appears to be a multistep process taking place 
throughout the tissue fields of exposure. When viewed in the context of chromosome 
changes, carcinogen exposure appears to be associated with the random acquisition 
of chromosome polysomy throughout the exposed field, the degree of which is relat- 
ed to the degree and extent of carcinogen exposure as well as to the instrinsic suscep- 
tibility of the exposed individual. Continued exposure leads to continued acquisition 
of new changes and, in association with chronic wound-healing processes, to the 
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accumulation of clonal outgrowths throughout the target tissue. Although the ulti- 
mate malignancy may occur in only one or few tissue sites, manifestations of the 
instability process that drives tumori genesis is globally present in the tissue. Thus 
random biopsies may provide useful risk information for the exposed field as a 
whole. Even when carcinogen exposure is reduced or chemopreventive strategies are 
initiated and histologic manifestations of the tumori genesis process subside, the 
genetic scars of prior exposure remain in the form of clonal outgrowths and may 
explain continued lung cancer risk in ex-smokers. Future chemoprevention strategies 
need to focus on reducing the degree of chromosome instability and on trying to 
eliminate residual abnormal clonal outgrowths in the aerodigestive tract In this set- 
ting, the measurement of chromosome instability in the target tissue will be useful in 
assessing cancer risk as well as response to intervention. 
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